Adaptive Rate and Energy Harvesting Interval Control Based on Reinforcement Learning for SWIPT

被引:26
作者
Chun, Chang-Jae [1 ]
Kang, Jae-Mo [1 ]
Kim, Il-Min [1 ]
机构
[1] Queens Univ, Dept Elect & Comp Engn, Kingston, ON K7L 3N6, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
MISO SWIPT; reinforcement learning; WIRELESS COMMUNICATIONS; CHANNEL; INFORMATION;
D O I
10.1109/LCOMM.2018.2876441
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In this letter, we propose a new adaptive rate and energy harvesting interval control scheme to maximize the throughout subject to the average energy constraint in the multiple-input single-output simultaneous wireless information and power transfer system. We consider the realistic scenario of time-varying fading channel. In order to maximize the throughput and simultaneously to maintain the average energy required at the receiver, we first formulate a problem of jointly optimizing the rate and energy harvesting interval based on a Markov decision process (MDP) by using a regularization parameter. However, this MDP problem is difficult to directly solve because the channel transition probabilities (i.e., the model or the environment) are challenging to estimate in the practical systems. Thus, we propose an adaptive rate and energy harvesting interval control algorithm based on the model-free reinforcement learning technique. Numerical results demonstrate that the proposed scheme significantly outperforms the conventional scheme.
引用
收藏
页码:2571 / 2574
页数:4
相关论文
共 12 条
[1]  
[Anonymous], 1993, ESIMATION THEORY
[2]  
[Anonymous], 2011, REINFORCEMENT LEARNI
[3]   Training-based MIMO channel estimation: A study of estimator tradeoffs and optimal training signals [J].
Biguesh, M ;
Gershman, AB .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (03) :884-893
[4]  
Haykin S., 2009, NEURAL NETWORKS LEAR
[5]  
Leite J. P., 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC), P809, DOI 10.1109/WCNC.2012.6214482
[6]   On Outage Probability for Stochastic Energy Harvesting Communications in Fading Channels [J].
Li, Wei ;
Ku, Meng-Lin ;
Chen, Yan ;
Liu, K. J. Ray .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (11) :1893-1897
[7]   Wireless Information Transfer with Opportunistic Energy Harvesting [J].
Liu, Liang ;
Zhang, Rui ;
Chua, Kee-Chaing .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (01) :288-300
[8]   Adaptive MQAM for Energy Harvesting Wireless Communications With 1-Bit Channel Feedback [J].
Ma, Rui ;
Zhang, Wei .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (11) :6459-6470
[9]   The effect upon channel capacity in wireless communications of perfect and imperfect knowledge of the channel [J].
Médard, M .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2000, 46 (03) :933-946
[10]   Optimizing Training Lengths and Training Intervals in Time-Varying Fading Channels [J].
Savazzi, Stefano ;
Spagnolini, Umberto .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (03) :1098-1112