A Turbo Q-Learning (TQL) for Energy Efficiency Optimization in Heterogeneous Networks

被引:4
作者
Wang, Xiumin [1 ]
Li, Lei [1 ]
Li, Jun [2 ]
Li, Zhengquan [3 ]
机构
[1] China Jiliang Univ, Coll Informat Engn, Hangzhou 310018, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Binjiang Coll, Wuxi 214105, Peoples R China
[3] Jiangnan Univ, Coll Internet Things, Wuxi 214000, Peoples R China
基金
中国国家自然科学基金;
关键词
energy efficiency; HetNets; eICIC; Q-Learning; reinforcement learning; multistage decision process; RESOURCE-ALLOCATION; POWER-CONTROL;
D O I
10.3390/e22090957
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In order to maximize energy efficiency in heterogeneous networks (HetNets), a turbo Q-Learning (TQL) combined with multistage decision process and tabular Q-Learning is proposed to optimize the resource configuration. For the large dimensions of action space, the problem of energy efficiency optimization is designed as a multistage decision process in this paper, according to the resource allocation of optimization objectives, the initial problem is divided into several subproblems which are solved by tabular Q-Learning, and the traditional exponential increasing size of action space is decomposed into linear increase. By iterating the solutions of subproblems, the initial problem is solved. The simple stability analysis of the algorithm is given in this paper. As to the large dimension of state space, we use a deep neural network (DNN) to classify states where the optimization policy of novel Q-Learning is set to label samples. Thus far, the dimensions of action and state space have been solved. The simulation results show that our approach is convergent, improves the convergence speed by 60% while maintaining almost the same energy efficiency and having the characteristics of system adjustment.
引用
收藏
页数:20
相关论文
共 38 条
[1]   Reinforcement Learning for Self Organization and Power Control of Two-Tier Heterogeneous Networks [J].
Amiri, Roohollah ;
Almasi, Mojtaba Ahmadi ;
Andrews, Jeffrey G. ;
Mehrpouyan, Hani .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (08) :3933-3947
[2]   Matching Game-Based Cell Association in Multi-RAT HetNet Considering Device Requirements [J].
Anany, Mohamed ;
Elmesalawy, Mahmoud M. ;
Abd El-Haleem, Ahmed M. .
IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (06) :9774-9782
[3]  
[Anonymous], 2010, 36814 3GPP TS ETSI
[4]   Channel Access and Power Control for Energy-Efficient Delay-Aware Heterogeneous Cellular Networks for Smart Grid Communications Using Deep Reinforcement Learning [J].
Asuhaimi, Fauzun Abdullah ;
Bu, Shengrong ;
Klaine, Paulo Valente ;
Imran, Muhammad Ali .
IEEE ACCESS, 2019, 7 :133474-133484
[5]   Energy Saving and Interference Coordination in HetNets Using Dynamic Programming and CEC [J].
Ayala-Romero, Jose A. ;
Alcaraz, Juan J. ;
Vales-Alonso, Javier .
IEEE ACCESS, 2018, 6 :71110-71121
[6]   Data-Driven Configuration of Interference Coordination Parameters in HetNets [J].
Ayala-Romero, Jose A. ;
Alcaraz, Juan J. ;
Vales-Alonso, Javier .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (06) :5174-5187
[7]   Joint Subcarrier Assignment and Global Energy-Efficient Power Allocation for Energy-Harvesting Two-Tier Downlink NOMA Hetnets [J].
Baidas, Mohammed W. ;
Al-Mubarak, Mubarak ;
Alsusa, Emad ;
Awad, Mohamad Khattar .
IEEE ACCESS, 2019, 7 :163556-163577
[8]   Optimization of Handover Parameters for LTE/LTE-A in-Building Systems [J].
Castro-Hernandez, Diego ;
Paranjape, Raman .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (06) :5260-5273
[9]   Classification of User Trajectories in LTE HetNets Using Unsupervised Shapelets and Multiresolution Wavelet Decomposition [J].
Castro-Hernandez, Diego ;
Paranjape, Raman .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (09) :7934-7946
[10]  
Chai-Elsholz R, 2011, NEW MIDDLE AGES, P1