On Optimal Power Control for URLLC over a Non-stationary Wireless Channel using Contextual Reinforcement Learning

被引:0
|
作者
Sharma, Mohit K.
Sun, Sumei
Kurniawan, Ernest
Tan, Peng Hui
机构
来源
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022) | 2022年
关键词
Energy minimization; non-stationary wireless channel; reinforcement learning; URLLC; LOW-LATENCY COMMUNICATIONS; COMMUNICATION; OPTIMIZATION; NETWORKS; SYSTEMS;
D O I
10.1109/ICC45855.2022.9839177
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In this work we investigate the design of energy-optimal policies for ultra-reliable low-latency communications (URLLC) over a non-stationary wireless channel, using a contextual reinforcement learning (RL) framework. We consider a point-to-point communication system over a piece-wise stationary wireless channel where the Doppler frequency of the channel switches between two distinct values, depending on the underlying state of the channel. To benchmark the performance, first we consider an oracle agent which has a perfect but causal information about the switching instants, and consists of two deep RL (DRL) agents each of which is tasked with optimal decision making in a unique partially stationary environment. Comparing the performance of the oracle agent with the conventional DRL reveals that the performance gain obtained using oracle agent depends on the dynamics of the non-stationary channel. In particular, for a non-stationary channel with faster switching rate the oracle agent results in approximately 15 - 20% less energy consumption. In contrast, for a channel with slower switching rate the performance of the oracle agent is similar to the conventional DRL agent. Next, for a more realistic scenario when the information about the switching instants for the Doppler frequency of the underlying channel is not available, we model the non-stationary channel as a regime switching process modulated by a Markov process, and adapt the oracle agent by aiding a state tracking algorithm proposed for the regime switching process. Our simulation results show that the proposed algorithm yields a better performance compared to the conventional DRL agent.
引用
收藏
页码:5493 / 5498
页数:6
相关论文
共 50 条
  • [1] Deep reinforcement learning control for non-stationary building energy management
    Naug, Avisek
    Quinones-Grueiro, Marcos
    Biswas, Gautam
    ENERGY AND BUILDINGS, 2022, 277
  • [2] Reinforcement learning algorithm for non-stationary environments
    Sindhu Padakandla
    Prabuchandran K. J.
    Shalabh Bhatnagar
    Applied Intelligence, 2020, 50 : 3590 - 3606
  • [3] Reinforcement learning algorithm for non-stationary environments
    Padakandla, Sindhu
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    APPLIED INTELLIGENCE, 2020, 50 (11) : 3590 - 3606
  • [4] Towards Reinforcement Learning for Non-stationary Environments
    Dal Toe, Sebastian Gregory
    Tiddeman, Bernard
    Mac Parthalain, Neil
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 41 - 52
  • [5] Channel Estimation for RIS Assisted Wireless Communications: Stationary or Non-Stationary?
    Chen, Yuhao
    Jian, Mengnan
    Dai, Linglong
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 3776 - 3791
  • [6] Choosing search heuristics by non-stationary reinforcement learning
    Nareyek, A
    METAHEURISTICS: COMPUTER DECISION-MAKING, 2004, 86 : 523 - +
  • [7] Learning in Non-Stationary Wireless Control Systems via Newton's Method
    Eisen, Mark
    Gatsis, Konstantinos
    Pappas, George J.
    Ribeiro, Alejandro
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1410 - 1417
  • [8] Predictive reinforcement learning in non-stationary environments using weighted mixture policy
    Pourshamsaei, Hossein
    Nobakhti, Amin
    APPLIED SOFT COMPUTING, 2024, 153
  • [9] Towards optimal HVAC control in non-stationary building environments combining active change detection and deep reinforcement learning
    Deng, Xiangtian
    Zhang, Yi
    Qi, He
    BUILDING AND ENVIRONMENT, 2022, 211
  • [10] Accelerated Variant of Reinforcement Learning Algorithms for Light Control with Non-stationary User Behaviour
    Haddam, Nassim
    Boulakia, Benjamin Cohen
    Barth, Dominique
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON SMART CITIES AND GREEN ICT SYSTEMS (SMARTGREENS), 2022, : 78 - 85