Self-Organization in Small Cell Networks: A Reinforcement Learning Approach

被引：137

作者：

Bennis, Mehdi ^{[1
]}

Perlaza, Samir M. ^{[2
]}

Blasco, Pol ^{[4
]}

Han, Zhu ^{[5
]}

Poor, H. Vincent ^{[3
]}

机构：

[1] Univ Oulu, Ctr Wireless Commun, SF-90100 Oulu, Finland

[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA

[3] Princeton Univ, Princeton, NJ 08544 USA

[4] CTTC, Barcelona, Spain

[5] Univ Houston, Elect & Comp Engn Dept, Houston, TX USA

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2013年 / 12卷 / 07期

关键词：

Small cell networks; self-organizing networks; game theory; reinforcement learning; EQUILIBRIA; FEMTOCELLS; INFORMATION; GAMES;

D O I：

10.1109/TWC.2013.060513.120959

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a decentralized and self-organizing mechanism for small cell networks (such as micro-, femto- and picocells) is proposed. In particular, an application to the case in which small cell networks aim to mitigate the interference caused to the macrocell network, while maximizing their own spectral efficiencies, is presented. The proposed mechanism is based on new notions of reinforcement learning (RL) through which small cells jointly estimate their time-average performance and optimize their probability distributions with which they judiciously choose their transmit configurations. Here, a minimum signal to interference plus noise ratio (SINR) is guaranteed at the macrocell user equipment (UE), while the small cells maximize their individual performances. The proposed RL procedure is fully distributed as every small cell base station requires only an observation of its instantaneous performance which can be obtained from its UE. Furthermore, it is shown that the proposed mechanism always converges to an epsilon Nash equilibrium when all small cells share the same interest. In addition, this mechanism is shown to possess better convergence properties and incur less overhead than existing techniques such as best response dynamics, fictitious play or classical RL. Finally, numerical results are given to validate the theoretical findings, highlighting the inherent tradeoffs facing small cells, namely exploration/exploitation, myopic/foresighted behavior and complete/incomplete information.

引用

页码：3202 / 3212

页数：11

共 37 条

[1] Femtocells: Past, Present, and Future [J].

Andrews, Jeffrey G. ;

Claussen, Holger ;

Dohler, Mischa ;

Rangan, Sundeep ;

Reed, Mark C. .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (03) :497-508

[2]

[Anonymous], IEEE T VEHICULAR COM

[3]

[Anonymous], 1998, THEORY LEARNING GAME

[4]

[Anonymous], NGMN REC SON O M REQ

[5]

[Anonymous], 2009, R4-092042

[6]

Bennis M., P 2010 IEEE INT WORK

[7]

Bennis M., P 2011 IEEE GLOB COM

[8]

Bennis M., P 2011 IEEE INT C CO

[9]

Borkar VS., 2009, Stochastic Approximation: A Dynamical Systems Viewpoint

[10] Femtocell Networks: A Survey [J].

Chandrasekhar, Vikram ;

Andrews, Jeffrey G. ;

Gatherer, Alan .

IEEE COMMUNICATIONS MAGAZINE, 2008, 46 (09) :59-67

← 1 2 3 4 →