Joint spectrum and power allocation scheme based on value decomposition networks in D2D communication networks

被引:1
作者
Huang, Zhongwei [1 ]
Li, Tong [1 ]
Song, Chenghao [1 ]
Li, Zhenxing [2 ]
Wang, Jie [2 ]
Liu, Xiao [1 ]
Chen, Haibo [1 ]
Zhao, Xiaorong [1 ]
Cao, Yewen [1 ]
机构
[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao 266237, Shandong, Peoples R China
[2] Xidian Univ, China Res Inst Radiowave Propagat, Qingdao 266075, Shandong, Peoples R China
关键词
Device-to-device (D2D) communication; Deep reinforcement learning; Resource allocation; RESOURCE-ALLOCATION; SELECTION;
D O I
10.1186/s13638-024-02393-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Device-to-device (D2D) communications allow short-range communication devices to multiplex cellular-licensed spectrum to directly establish local connections for ultra-high number of terminal connections and greater system throughput. However, spectrum sharing also brings serious interference to the network. Therefore, a reliable and efficient resource allocation strategy is important to mitigate the interference and improve the system spectral efficiency. In this paper, we investigated spectrum access and power allocation in D2D communications underlay cellular networks based on deep reinforcement learning with the aim of finding a feasible resource allocation strategy to maximize data rate and system fairness. We proposed a value decomposition network-based resource allocation scheme for D2D communication networks. Our proposed scheme avoids frequent information exchanges among D2D users by centralized training, while allowing D2D users to make distributed joint resource allocation decisions. Simulation results show that the proposed scheme has stable convergence and good scalability, and can effectively improve the system capacity.
引用
收藏
页数:16
相关论文
共 31 条
[1]   A Survey on Security Aspects for 3GPP 5G Networks [J].
Cao, Jin ;
Ma, Maode ;
Li, Hui ;
Ma, Ruhui ;
Sun, Yunqing ;
Yu, Pu ;
Xiong, Lihui .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2020, 22 (01) :170-195
[2]  
Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, DOI 10.48550/ARXIV.1412.3555]
[3]   Joint Mode Selection and Resource Allocation for D2D-Enabled NOMA Cellular Networks [J].
Dai, Yanpeng ;
Sheng, Min ;
Liu, Junyu ;
Cheng, Nan ;
Shen, Xuemin ;
Yang, Qinghai .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (07) :6721-6733
[4]   A Survey of 5G Network: Architecture and Emerging Technologies [J].
Gupta, Akhil ;
Jha, Rakesh Kumar .
IEEE ACCESS, 2015, 3 :1206-1232
[5]   A Zero-Sum Game-Based Secure and Interference Mitigation Scheme for Socially Aware D2D Communication With Imperfect CSI [J].
Gupta, Rajesh ;
Tanwar, Sudeep .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (03) :3478-3486
[6]   Resource allocation based on hybrid genetic algorithm and particle swarm optimization for D2D multicast communications [J].
Hamdi, Monia ;
Zaied, Mourad .
APPLIED SOFT COMPUTING, 2019, 83
[7]  
Hao J, 2014, IEEE INT CONF COMMUN, P256, DOI 10.1109/ICCChina.2014.7008282
[8]  
Hausknecht M., 2015, arXiv
[9]   Dynamic Spectrum Access for D2D-Enabled Internet of Things: A Deep Reinforcement Learning Approach [J].
Huang, Jingfei ;
Yang, Yang ;
Gao, Zhen ;
He, Dazhong ;
Ng, Derrick Wing Kwan .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (18) :17793-17807
[10]   Deep Reinforcement Learning-Based Dynamic Spectrum Access for D2D Communication Underlay Cellular Networks [J].
Huang, Jingfei ;
Yang, Yang ;
He, Gang ;
Xiao, Yang ;
Liu, Jun .
IEEE COMMUNICATIONS LETTERS, 2021, 25 (08) :2614-2618