Online Learning for Energy Saving and Interference Coordination in HetNets

被引:24
作者
Ayala-Romero, Jose A. [1 ]
Alcaraz, Juan J. [1 ]
Zanella, Andrea [2 ]
Zorzi, Michele [2 ]
机构
[1] Tech Univ Cartagena, Dept Informat & Commun Technol, Cartagena 30202, Spain
[2] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
Online learning; conextual multi-armed bandit; green networks; heterogeneous networks; interference coordination; ON/OFF STRATEGIES; PARAMETERS; NETWORKS;
D O I
10.1109/JSAC.2019.2904362
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In heterogeneous cellular networks (HetNets), switching OFF small cells under low user traffic periods has been proved to be an effective energy saving strategy. However, this strategy has strong interactions with interference coordination (IC) mechanisms, making it convenient to address both tasks simultaneously. The motivation of this paper is to develop a self-optimization algorithm capable of jointly controlling energy saving and IC mechanisms using an online learning approach. Our proposal is based on a contextual bandit formulation that, among other challenges, implies discovering the most energy-efficient control actions while satisfying a predefined level of Quality of Service (QoS) for the users. We propose a two-level framework comprising a global controller, in charge of a group of macro cells, and multiple local controllers, one per macro cell. The global controller implements a novel algorithm, referred to as the Bayesian Response Estimation and Threshold Search (BRETS), that is capable of learning, for each control action, its feasibility boundaries in terms of QoS and its energy consumption as a function of the aggregated user traffic. The algorithm comes with a bound on its expected convergence time. The local controllers translate the control actions learned by the global controller into local decisions. Our numerical results show that BRETS is only 1% less efficient than an ideal oracle policy, clearly outperforming other benchmark algorithms.
引用
收藏
页码:1374 / 1388
页数:15
相关论文
共 47 条
[1]  
3GPP, 2014, 36887 3GPP TR
[2]  
Abadi Martin, 2016, arXiv
[3]   A neural networks committee for the contextual bandit problem [J].
Allesiardo, Robin ;
Féraud, Raphaël ;
Bouneffouf, Djallel .
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8834 :374-381
[4]  
[Anonymous], 2002, MACH LEARN
[5]  
[Anonymous], 2010, 36814 TR 3GPP
[6]  
[Anonymous], 2009, PROC IEEE 70 VEH TEC
[7]  
[Anonymous], 2010, R1100142 3GPP
[8]  
Auer P., 2002, Journal of Machine Learning Research, V3, P397, DOI DOI 10.4271/610369
[9]  
Ayala-Romero J. A., 2018, P IEEE INT C COMM IC, P1
[10]   Energy Saving and Interference Coordination in HetNets Using Dynamic Programming and CEC [J].
Ayala-Romero, Jose A. ;
Alcaraz, Juan J. ;
Vales-Alonso, Javier .
IEEE ACCESS, 2018, 6 :71110-71121