Efficient exploration in reinforcement learning-based cognitive radio spectrum sharing

被引:41
作者
Jiang, T. [1 ]
Grace, D. [1 ]
Mitchell, P. D. [1 ]
机构
[1] Univ York, Dept Elect, Commun Res Grp, York YO10 5DD, N Yorkshire, England
关键词
CHANNEL ASSIGNMENT; POWER-CONTROL;
D O I
10.1049/iet-com.2010.0258
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study introduces two novel approaches, pre-partitioning and weight-driven exploration, to enable an efficient learning process in the context of cognitive radio. Learning efficiency is crucial when applying reinforcement learning to cognitive radio since cognitive radio users will cause a higher level of disturbance in the exploration phase. Careful control of the tradeoff between exploration and exploitation for a learning-enabled cognitive radio in order to efficiently learn from the interactions with a dynamic radio environment is investigated. In the pre-partitioning scheme, the potential action space of cognitive radios is reduced by initially randomly partitioning the spectrum in each cognitive radio. Cognitive radios are therefore able to finish their exploration stage faster than more basic reinforcement learning-based schemes. In the weight-driven exploration scheme, exploitation is merged into exploration by taking into account the knowledge gained in exploration to influence action selection, thereby achieving a more efficient exploration phase. The learning efficiency in a cognitive radio scenario is defined and the learning efficiency of the proposed schemes is investigated. The simulation results show that the exploration of cognitive radio is more efficient by using pre-partitioning and weight-driven exploration and the system performance is improved accordingly.
引用
收藏
页码:1309 / 1317
页数:9
相关论文
共 29 条
[1]   NeXt generation/dynamic spectrum access/cognitive radio wireless networks: A survey [J].
Akyildiz, Ian F. ;
Lee, Won-Yeol ;
Vuran, Mehmet C. ;
Mohanty, Shantidev .
COMPUTER NETWORKS, 2006, 50 (13) :2127-2159
[2]  
[Anonymous], 1975, Queueing Systems
[3]  
[Anonymous], 14 IST MOB WIR COMM
[4]  
[Anonymous], 2000, COGNITIVE RADIO INTE
[5]  
BUBLIN M, 2008, 5 KARLSR WORKSH SOFT
[6]  
CHEN T, 2008, IEEE ICC IEEE COCONE
[7]   Applications of machine learning to cognitive radio networks [J].
Clancy, Charles ;
Hecker, Joe ;
Stuntebeck, Erich ;
O'Shea, Tim .
IEEE WIRELESS COMMUNICATIONS, 2007, 14 (04) :47-52
[8]  
CORDEIRO C, 2005, 80222 IEEE DYSPAN
[9]  
DASILVA L, 2007, COGNITIVE NETWORKS T
[10]  
Fette B., 2006, COGNITIVE RADIO TECH