D2D Resource Allocation Based on Reinforcement Learning and QoS

被引:1
作者
Kuo, Fang-Chang [1 ]
Wang, Hwang-Cheng [1 ]
Tseng, Chih-Cheng [2 ]
Wu, Jung-Shyr [3 ]
Xu, Jia-Hao [1 ]
Chang, Jieh-Ren [1 ]
机构
[1] Natl Ilan Univ, Dept Elect Engn, Yilan, Taiwan
[2] Natl Ilan Univ, Dept Elect Engn, Yilan, Taiwan
[3] Natl Cent Univ, Dept Commun Engn, Taoyuan City, Taiwan
关键词
Device-to-device (D2D); Resource allocation; Reinforcement learning; Multi-Player Multi-Armed Bandit (MPMAB); Dynamic resource allocation; DEVICE; NETWORKS; UPLINK; LTE;
D O I
10.1007/s11036-023-02145-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Device-to-device (D2D) communications is designed to improve the overall network performance, including low latency, high data rates, and system capacity of the fifth-generation (5G) wireless networks. The system capacity can even be improved by reusing resources between D2D user equipments (DUEs) and cellular user equipments (CUEs) without causing harmful interference to the CUEs. A D2D resource allocation scheme is expected to have the characteristic that one CUE be allocated with a variable number of resource blocks (RBs), and the RBs be reused by more than one DUE. In this study, the Multi-Player Multi-Armed Bandit (MPMAB) reinforcement learning scheme is employed to model such a problem by establishing a preference matrix to facilitate greedy resource allocation. A fair resource allocation scheme is then proposed and shown to achieve fairness, prevent waste of resources, and alleviate starvation. Moreover, this scheme has better performance when there are not too many D2D pairs.
引用
收藏
页码:1076 / 1095
页数:20
相关论文
共 42 条
[1]   Next Generation 5G Wireless Networks: A Comprehensive Survey [J].
Agiwal, Mamta ;
Roy, Abhishek ;
Saxena, Navrati .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2016, 18 (03) :1617-1655
[2]  
Alpaydin E., 2020, INTRO MACHINE LEARNI
[3]  
[Anonymous], 2014, 36843 3GPP TR
[4]  
[Anonymous], 2016, 36213 3GPP TS
[5]   A Survey on Device-to-Device Communication in Cellular Networks [J].
Asadi, Arash ;
Wang, Qing ;
Mancuso, Vincenzo .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (04) :1801-1819
[6]  
contributors W., 2023, MULT BAND
[7]   LTE for Public Safety Networks [J].
Doumi, Tewfik ;
Dolan, Mike F. ;
Tatesh, Said ;
Casati, Alessio ;
Tsirtsis, George ;
Anchan, Kiran ;
Flore, Dino .
IEEE COMMUNICATIONS MAGAZINE, 2013, 51 (02) :106-112
[8]  
Ericsson L., 2011, White Pap, V14, P124
[9]   Device-to-Device Communications in Cellular Networks [J].
Feng, Daquan ;
Lu, Lu ;
Yi Yuan-Wu ;
Li, Geoffrey Ye ;
Li, Shaoqian ;
Feng, Gang .
IEEE COMMUNICATIONS MAGAZINE, 2014, 52 (04) :49-55
[10]  
Friedman Jerome., 2001, The elements of statistical learning, V1, DOI DOI 10.1007/B94608