Scheduling Algorithms for Minimizing Age of Information in Wireless Broadcast Networks with Random Arrivals

被引:134
作者
Hsu, Yu-Pin [1 ]
Modiano, Eytan [2 ]
Duan, Lingjie [3 ]
机构
[1] Natl Taipei Univ, Dept Commun Engn, New Taipei 23741, Taiwan
[2] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
[3] Singapore Univ Technol & Design, Engn Syst & Design Pillar, Singapore, Singapore
关键词
Age of information; scheduling algorithms; Markov decision processes; MARKOV DECISION-PROCESSES; AVERAGE; POLICIES;
D O I
10.1109/TMC.2019.2936199
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Age of information is a new network performance metric that captures the freshness of information at end-users. This paper studies the age of information from a scheduling perspective. To that end, we consider a wireless broadcast network where a basestation (BS) is updating many users on random information arrivals under a transmission capacity constraint. For the offline case when the arrival statistics are known to the BS, we develop a structural MDP scheduling algorithm and an index scheduling algorithm, leveraging Markov decision process (MDP) techniques and the Whittle's methodology for restless bandits. By exploring optimal structural results, we not only reduce the computational complexity of the MDP-based algorithm, but also simplify deriving a closed form of the Whittle index. Moreover, for the online case, we develop an MDP-based online scheduling algorithm and an index-based online scheduling algorithm. Both the structural MDP scheduling algorithm and the MDP-based online scheduling algorithm asymptotically minimize the average age, while the index scheduling algorithm minimizes the average age when the information arrival rates for all users are the same. Finally, the algorithms are validated via extensive numerical studies.
引用
收藏
页码:2903 / 2915
页数:13
相关论文
共 35 条
[1]   Learning algorithms or Markov decision processes with average cost [J].
Abounadi, J ;
Bertsekas, D ;
Borkar, VS .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2001, 40 (03) :681-698
[2]  
[Anonymous], 2012, DYNAMIC PROGRAMMING
[3]  
[Anonymous], 2011, Multi-armed bandit allocation indices
[4]  
Bacinoglu BT, 2015, 2015 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), P25, DOI 10.1109/ITA.2015.7308962
[5]  
Bertsekas D., 2012, Dynamic Programming and Optimal Control, V4
[6]  
Borkar V. S., 2009, Stochastic Approximation: A Dynamical Systems Viewpoint, V48
[7]   The ODE method for convergence of stochastic approximation and reinforcement learning [J].
Borkar, VS ;
Meyn, SP .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2000, 38 (02) :447-469
[8]   A COUNTEREXAMPLE ON THE OPTIMALITY EQUATION IN MARKOV DECISION CHAINS WITH THE AVERAGE COST CRITERION [J].
CAVAZOSCADENA, R .
SYSTEMS & CONTROL LETTERS, 1991, 16 (05) :387-392
[9]  
Cho J, 2000, SIGMOD REC, V29, P117, DOI 10.1145/335191.335391
[10]   On the Age of Information in Status Update Systems With Packet Management [J].
Costa, Maice ;
Codreanu, Marian ;
Ephremides, Anthony .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (04) :1897-1910