Joint Coordinated Beamforming and Power Splitting Ratio Optimization in MU-MISO SWIPT-Enabled HetNets: A Multi-Agent DDQN-Based Approach

被引:61
作者
Zhang, Ruichen [1 ,2 ]
Xiong, Ke [1 ,2 ]
Lu, Yang [1 ,2 ]
Gao, Bo [1 ,2 ]
Fan, Pingyi [3 ,4 ]
Ben Letaief, Khaled [5 ,6 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Traff Data Anal & Min, Beijing 100044, Peoples R China
[3] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100084, Peoples R China
[4] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[5] Hong Kong Univ Sci & Technol HKUST, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[6] Peng Cheng Lab, Shenzhen 518055, Guangdong, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Array signal processing; Quality of service; Optimization; Wireless communication; Macrocell networks; Femtocells; Interference; Femtocell HetNets; MU-MISO; SWIPT; reinforcement learning; multi-agent double deep Q network; USER ASSOCIATION; CELLULAR NETWORKS; WIRELESS INFORMATION; RESOURCE-ALLOCATION; SECURE SWIPT; ENERGY; SYSTEMS;
D O I
10.1109/JSAC.2021.3118397
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a multi-agent double deep Q network (DDQN)-based approach to jointly optimize the beamforming vectors and power splitting (PS) ratio in multi-user multiple-input single-output (MU-MISO) simultaneous wireless information and power transfer (SWIPT)-enabled heterogeneous networks (HetNets), where a macro base station (MBS) and several femto base stations (FBSs) serve multiple macro user equipments (MUEs) and femto user equipments (FUEs). The PS receiver architecture is deployed at FUEs. An optimization problem is formulated to maximize the achievable sum information rate of FUEs under the constraints of the achievable information rate requirements of MUEs and FUEs and the energy harvesting (EH) requirements of FUEs. Since the optimization problem is challenging to handle due to the high dimension and time-varying environment, an efficient multi-agent DDQN-based algorithm is presented, which is trained in a centralized manner and runs in a distributed manner, where two sets of deep neural network parameters are jointly updated and trained to tackle the problem and avoid overestimation. To facilitate the presented multi-agent DDQN-based algorithm, the action space, the state space and the reward function are designed, where the codebook matrix is employed to deal with the complex transmit beamforming vectors. Simulation results validate the proposed algorithm. Notable performance gains are achieved by the proposed algorithm due to considering the beam directions in the action space and the adaptability to the Doppler frequency shifts. Besides, the proposed algorithm is shown to be superior to other benchmark ones numerically.
引用
收藏
页码:677 / 693
页数:17
相关论文
共 56 条
[1]   Reinforcement Learning for Self Organization and Power Control of Two-Tier Heterogeneous Networks [J].
Amiri, Roohollah ;
Almasi, Mojtaba Ahmadi ;
Andrews, Jeffrey G. ;
Mehrpouyan, Hani .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (08) :3933-3947
[2]   Seven Ways that HetNets Are a Cellular Paradigm Shift [J].
Andrews, Jeffrey G. .
IEEE COMMUNICATIONS MAGAZINE, 2013, 51 (03) :136-144
[3]   A Survey on Device-to-Device Communication in Cellular Networks [J].
Asadi, Arash ;
Wang, Qing ;
Mancuso, Vincenzo .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (04) :1801-1819
[4]  
Bennis M., 2010, 2010 IEEE Globecom Workshops (GC'10), P706, DOI 10.1109/GLOCOMW.2010.5700414
[5]   Coordination Mechanisms for Self-Organizing Femtocells in Two-Tier Coexistence Scenarios [J].
de Lima, Carlos H. M. ;
Bennis, Mehdi ;
Latva-aho, Matti .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2012, 11 (06) :2212-2223
[6]   Application of Smart Antenna Technologies in Simultaneous Wireless Information and Power Transfer [J].
Ding, Zhiguo ;
Zhong, Caijun ;
Ng, Derrick Wing Kwan ;
Peng, Mugen ;
Suraweera, Himal A. ;
Schober, Robert ;
Poor, H. Vincent .
IEEE COMMUNICATIONS MAGAZINE, 2015, 53 (04) :86-93
[7]   Power Allocation Strategies in Energy Harvesting Wireless Cooperative Networks [J].
Ding, Zhiguo ;
Perlaza, Samir M. ;
Esnaola, Inaki ;
Poor, H. Vincent .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2014, 13 (02) :846-860
[8]  
Evolved Universal Terrestrial Radio Access (E-UTRA)
[9]  
Radio Resource Control (RRC)
[10]  
Protocol Specification, 2017, DOCUMENT TS 36331 VE