Reinforcement Learning for Energy Harvesting Point-to-Point Communications

被引:43
作者
Ortiz, Andrea [1 ]
Al-Shatri, Hussein [1 ]
Li, Xiang [2 ]
Weber, Tobias [2 ]
Klein, Anja [1 ]
机构
[1] Tech Univ Darmstadt, Commun Engn Lab, Merckstr 25, D-64283 Darmstadt, Germany
[2] Univ Rostock, Inst Commun Engn, Richard Wagner Str 31, D-18119 Rostock, Germany
来源
2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC) | 2016年
关键词
TRANSMISSION;
D O I
10.1109/ICC.2016.7511405
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Energy harvesting point-to-point communications are considered. The transmitter harvests energy from the environment and stores it in a finite battery. It is assumed that the transmitter has always data to transmit and the harvested energy is used exclusively for data transmission. As in practical scenarios prior knowledge about the energy harvesting process might not be available, we assume that at each time instant only information about the current state of the transmitter is available, i.e., harvested energy, battery level and channel coefficient. We model the scenario as a Markov decision process and we implement reinforcement learning at the transmitter to find a power allocation policy that aims at maximizing the throughput. To overcome the limitations of traditional reinforcement learning algorithms, we apply the concept of function approximation and we propose a set of binary functions to approximate the expected throughput given the state of the transmitter. Numerical results show that the performance of the proposed approach, which requires only causal knowledge of the energy harvesting process and channel coefficients, has only a small degradation compared to the optimum case which requires perfect non-causal knowledge. Additionally, the proposed approach outperforms naive policies that assume only causal knowledge at the transmitter.
引用
收藏
页数:6
相关论文
共 13 条
[1]  
[Anonymous], 2008, P 25 INT C MACHINE L
[2]   A Learning Theoretic Approach to Energy Harvesting Communication System Optimization [J].
Blasco, Pol ;
Guenduez, Deniz ;
Dohler, Mischa .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (04) :1872-1882
[3]  
Gordon GJ, 2001, ADV NEUR IN, V13, P1040
[4]   Designing Intelligent Energy Harvesting Communication Systems [J].
Guenduez, Deniz ;
Stamatiou, Kostas ;
Michelusi, Nicolo ;
Zorzi, Michele .
IEEE COMMUNICATIONS MAGAZINE, 2014, 52 (01) :210-216
[5]   A Generic Model for Optimizing Single-Hop Transmission Policy of Replenishable Sensors [J].
Lei, Jing ;
Yates, Roy ;
Greenstein, Larry .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2009, 8 (02) :547-551
[6]   Optimal Save-Then-Transmit Protocol for Energy Harvesting Wireless Transmitters [J].
Luo, Shixin ;
Zhang, Rui ;
Lim, Teng Joon .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2013, 12 (03) :1196-1207
[7]  
Orhan O, 2012, 2012 IEEE INFORMATION THEORY WORKSHOP (ITW), P84, DOI 10.1109/ITW.2012.6404770
[8]   Transmission with Energy Harvesting Nodes in Fading Wireless Channels: Optimal Policies [J].
Ozel, Omur ;
Tutuncuoglu, Kaya ;
Yang, Jing ;
Ulukus, Sennur ;
Yener, Aylin .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2011, 29 (08) :1732-1743
[9]  
Russell S., 2010, Artificial Intelligence-A modern Approach
[10]  
Sutton R. S., 1198, REINFORCEMENT LEARNI