Multi-Access Communications With Energy Harvesting: A Multi-Armed Bandit Model and the Optimality of the Myopic Policy

被引:32
作者
Blasco, Pol [1 ]
Guenduez, Deniz [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, London SW7 2AZ, England
关键词
Energy harvesting; myopic policy; multi-access; online scheduling; partially observable Markov decision process; restless multi-armed bandit problem; MULTICHANNEL OPPORTUNISTIC ACCESS; RESTLESS BANDITS;
D O I
10.1109/JSAC.2015.2391852
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A multi-access wireless network with N transmitting nodes, each equipped with an energy harvesting (EH) device and a rechargeable battery of finite capacity, is studied. At each time slot (TS) a node is operative with a certain probability, which may depend on the availability of data, or the state of its channel. The energy arrival process at each node is modelled as an independent two-state Markov process, such that, at each TS, a node either harvests one unit of energy, or none. At each TS a subset of the nodes is scheduled by the access point (AP). The scheduling policy that maximises the total throughput is studied assuming that the AP does not know the states of either the EH processes or the batteries. The problem is identified as a restless multi-armed bandit (RMAB) problem, and an upper bound on the optimal scheduling policy is found. Under certain assumptions regarding the EH processes and the battery sizes, the optimality of the myopic policy (MP) is proven. For the general case, the performance of MP is compared numerically to the upper bound.
引用
收藏
页码:585 / 597
页数:13
相关论文
共 22 条
[1]   Multi-channel Opportunistic Access: A Case of Restless Bandits with Multiple Plays [J].
Ahmad, Sahand Haji Ali ;
Liu, Mingyan .
2009 47TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, VOLS 1 AND 2, 2009, :1361-1368
[2]   Optimality of Myopic Sensing in Multichannel Opportunistic Access [J].
Ahmad, Sahand Haji Ali ;
Liu, Mingyan ;
Javidi, Tara ;
Zhao, Qing ;
Krishnamachari, Bhaskar .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (09) :4040-4050
[3]  
[Anonymous], 2012, Dynamic Programming and Optimal Control
[4]   Transmit Power Control Policies for Energy Harvesting Sensors With Retransmissions [J].
Aprem, Anup ;
Murthy, Chandra R. ;
Mehta, Neelesh B. .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (05) :895-906
[5]   Restless bandits, linear programming relaxations, and a primal-dual index heuristic [J].
Bertsimas, D ;
Niño-Mora, J .
OPERATIONS RESEARCH, 2000, 48 (01) :80-90
[6]  
Blasco P, 2013, IEEE INT SYMP INFO, P1601, DOI 10.1109/ISIT.2013.6620497
[7]   A Learning Theoretic Approach to Energy Harvesting Communication System Optimization [J].
Blasco, Pol ;
Guenduez, Deniz ;
Dohler, Mischa .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (04) :1872-1882
[8]   A General Framework for the Optimization of Energy Harvesting Communication Systems with Battery Imperfections [J].
Devillers, Bertrand ;
Guenduez, Deniz .
JOURNAL OF COMMUNICATIONS AND NETWORKS, 2012, 14 (02) :130-139
[9]   Designing Intelligent Energy Harvesting Communication Systems [J].
Guenduez, Deniz ;
Stamatiou, Kostas ;
Michelusi, Nicolo ;
Zorzi, Michele .
IEEE COMMUNICATIONS MAGAZINE, 2014, 52 (01) :210-216
[10]  
Gul O. M., 2014, IEEE WCNC, P1