Self-Organization in Small Cell Networks: A Reinforcement Learning Approach

被引:137
作者
Bennis, Mehdi [1 ]
Perlaza, Samir M. [2 ]
Blasco, Pol [4 ]
Han, Zhu [5 ]
Poor, H. Vincent [3 ]
机构
[1] Univ Oulu, Ctr Wireless Commun, SF-90100 Oulu, Finland
[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[3] Princeton Univ, Princeton, NJ 08544 USA
[4] CTTC, Barcelona, Spain
[5] Univ Houston, Elect & Comp Engn Dept, Houston, TX USA
关键词
Small cell networks; self-organizing networks; game theory; reinforcement learning; EQUILIBRIA; FEMTOCELLS; INFORMATION; GAMES;
D O I
10.1109/TWC.2013.060513.120959
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a decentralized and self-organizing mechanism for small cell networks (such as micro-, femto- and picocells) is proposed. In particular, an application to the case in which small cell networks aim to mitigate the interference caused to the macrocell network, while maximizing their own spectral efficiencies, is presented. The proposed mechanism is based on new notions of reinforcement learning (RL) through which small cells jointly estimate their time-average performance and optimize their probability distributions with which they judiciously choose their transmit configurations. Here, a minimum signal to interference plus noise ratio (SINR) is guaranteed at the macrocell user equipment (UE), while the small cells maximize their individual performances. The proposed RL procedure is fully distributed as every small cell base station requires only an observation of its instantaneous performance which can be obtained from its UE. Furthermore, it is shown that the proposed mechanism always converges to an epsilon Nash equilibrium when all small cells share the same interest. In addition, this mechanism is shown to possess better convergence properties and incur less overhead than existing techniques such as best response dynamics, fictitious play or classical RL. Finally, numerical results are given to validate the theoretical findings, highlighting the inherent tradeoffs facing small cells, namely exploration/exploitation, myopic/foresighted behavior and complete/incomplete information.
引用
收藏
页码:3202 / 3212
页数:11
相关论文
共 37 条
[21]  
Li H., P 2009 IEEE C SYST M
[22]   ENHANCED INTERCELL INTERFERENCE COORDINATION CHALLENGES IN HETEROGENEOUS NETWORKS [J].
Lopez-Perez, David ;
Guevenc, Ismail ;
de la Roche, Guillaume ;
Kountouris, Marios ;
Quek, Tony Q. S. ;
Zhang, Jie .
IEEE WIRELESS COMMUNICATIONS, 2011, 18 (03) :22-30
[23]   OFDMA Femtocells: A Roadmap on Interference Avoidance [J].
Lopez-Perez, David ;
Valcarce, Alvaro ;
de la Roche, Guillaume ;
Zhang, Jie .
IEEE COMMUNICATIONS MAGAZINE, 2009, 47 (09) :41-48
[24]   Joint Strategy Fictitious Play With Inertia for Potential Games [J].
Marden, Jason R. ;
Arslan, Guerdal ;
Shamma, Jeff S. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2009, 54 (02) :208-220
[25]  
MCFADDEN D, 1976, ANN ECON SOC MEAS, V5, P363
[26]  
McKelvey R. D., 1998, Experimental Economics, V1, P9, DOI 10.1007/BF01426213
[27]   QUANTAL RESPONSE EQUILIBRIA FOR NORMAL-FORM GAMES [J].
MCKELVEY, RD ;
PALFREY, TR .
GAMES AND ECONOMIC BEHAVIOR, 1995, 10 (01) :6-38
[28]   Potential games [J].
Monderer, D ;
Shapley, LS .
GAMES AND ECONOMIC BEHAVIOR, 1996, 14 (01) :124-143
[30]  
Nguyen K., P 2010 IEEE MULT SYS