A Reinforcement Learning Framework for Optimizing Age of Information in RF-Powered Communication Systems

被引:108
作者
Abd-Elmagid, Mohamed A. [1 ]
Dhillon, Harpreet S. [1 ]
Pappas, Nikolaos [2 ]
机构
[1] Virginia Tech, Dept ECE, Wireless VT, Blacksburg, VA 24061 USA
[2] Linkoping Univ, Dept Sci & Technol, SE-60174 Norrkoping, Sweden
关键词
Batteries; Energy harvesting; Reinforcement learning; System analysis and design; Real-time systems; Wireless communication; Age of Information; RF energy harvesting; Markov Decision Process; MINIMIZING AGE; STATUS UPDATE; AVERAGE AGE; PEAK AGE; ENERGY; NETWORKS; INTERNET; SENSOR; TRANSMISSION; MINIMIZATION;
D O I
10.1109/TCOMM.2020.2991992
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we study a real-time monitoring system in which multiple source nodes are responsible for sending update packets to a common destination node in order to maintain the freshness of information at the destination. Since it may not always be feasible to replace or recharge batteries in all source nodes, we consider that the nodes are powered through wireless energy transfer (WET) by the destination. For this system setup, we investigate the optimal online sampling policy (referred to as the age-optimal policy) that jointly optimizes WET and scheduling of update packet transmissions with the objective of minimizing the long-term average weighted sum of Age of Information (AoI) values for different physical processes (observed by the source nodes) at the destination node, referred to as the sum-AoI. To solve this optimization problem, we first model this setup as an average cost Markov decision process (MDP) with finite state and action spaces. Due to the extreme curse of dimensionality in the state space of the formulated MDP, classical reinforcement learning algorithms are no longer applicable to our problem even for reasonable-scale settings. Motivated by this, we propose a deep reinforcement learning (DRL) algorithm that can learn the age-optimal policy in a computationally-efficient manner. We further characterize the structural properties of the age-optimal policy analytically, and demonstrate that it has a threshold-based structure with respect to the AoI values for different processes. We extend our analysis to characterize the structural properties of the policy that maximizes average throughput for our system setup, referred to as the throughput-optimal policy. Afterwards, we analytically demonstrate that the structures of the age-optimal and throughput-optimal policies are different. We also numerically demonstrate these structures as well as the impact of system design parameters on the optimal achievable average weighted sum-AoI.
引用
收藏
页码:4747 / 4760
页数:14
相关论文
共 58 条
  • [1] Abd-Elmagid M. A., 2019, PROC IEEE GLOBAL COM, P1
  • [2] On the Role of Age of Information in the Internet of Things
    Abd-Elmagid, Mohamed A.
    Pappas, Nikolaos
    Dhillon, Arpreet S.
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2019, 57 (12) : 72 - 77
  • [3] Joint Energy and SINR Coverage in Spatially Clustered RF-Powered IoT Network
    Abd-Elmagid, Mohamed A.
    Kishk, Mustafa A.
    Dhillon, Harpreet S.
    [J]. IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2019, 3 (01): : 132 - 146
  • [4] Average Peak Age-of-Information Minimization in UAV-Assisted IoT Networks
    Abd-Elmagid, Mohamed A.
    Dhillon, Harpreet S.
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (02) : 2003 - 2008
  • [5] AbdelAziz M. K., 2018, P IEEE GLOB COMM C
  • [6] [Anonymous], 2019, IEEE GLOBE WORK
  • [7] Age-Minimal Transmission for Energy Harvesting Sensors With Finite Batteries: Online Policies
    Arafa, Ahmed
    Yang, Jing
    Ulukus, Sennur
    Poor, H. Vincent
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (01) : 534 - 556
  • [8] Timely Updates in Energy Harvesting Two-Hop Networks: Offline and Online Policies
    Arafa, Ahmed
    Ulukus, Sennur
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (08) : 4017 - 4030
  • [9] Bacinoglu BT, 2018, IEEE INT SYMP INFO, P876, DOI 10.1109/ISIT.2018.8437573
  • [10] Bacinoglu BT, 2015, 2015 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), P25, DOI 10.1109/ITA.2015.7308962