Learning-based resource allocation in D2D communications with QoS and fairness considerations

被引:6
作者
Rashed, Salma Kazemi [1 ]
Shahbazian, Reza [1 ]
Ghorashi, Seyed Ali [1 ,2 ]
机构
[1] Shahid Beheshti Univ, Dept Elect Engn, Cognit Telecommun Res Grp, Tehran, Iran
[2] Shahid Beheshti Univ, Cyberspace Res Inst, Tehran, Iran
来源
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES | 2018年 / 29卷 / 01期
关键词
SELECTION; SPECTRUM; CHANNEL; NETWORKS; POLICY; MODE;
D O I
10.1002/ett.3249
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In device-to-device (D2D) communications, D2D users establish a direct link by utilizing the cellular users' spectrum to increase the network spectral efficiency. However, due to the higher priority of cellular users, interference imposed by D2D users to cellular ones should be controlled by channel and power allocation algorithms. Due to the unknown distribution of dynamic channel parameters, learning-based resource allocation algorithms work more efficient than classic optimization methods. In this paper, the problem of the joint channel and power allocation for D2D users in realistic scenarios is formulated as an interactive learning problem, where the channel state information of selected channels is unknown to the decision center and learned during the allocation process. In order to achieve the maximum reward function by choosing an action (channel and power level) for each D2D pair, a recency-based Q-learning method is introduced to find the best channel-power for each D2D pair. The proposed method is shown to achieve logarithmic regret function asymptotically, which makes it an order optimal policy, and it converges to the stable equilibrium solution. The simulation results confirm that the proposed method achieves better responses in terms of network sum rate and fairness criterion in comparison with conventional learning methods and random allocation.
引用
收藏
页数:20
相关论文
共 39 条
  • [31] Device-to-device resource allocation in LTE-advanced networks by hybrid particle swarm optimization and genetic algorithm
    Sun, Shijie
    Kim, Kwang-Yul
    Shin, Oh-Soon
    Shin, Yoan
    [J]. PEER-TO-PEER NETWORKING AND APPLICATIONS, 2016, 9 (05) : 945 - 954
  • [32] Energy-Efficient Resource Allocation for Device-to-Device Underlay Communication
    Wang, Feiran
    Xu, Chen
    Song, Lingyang
    Han, Zhu
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (04) : 2082 - 2092
  • [33] Resource Allocation for D2D Communications Underlay in Rayleigh Fading Channels
    Wang, Li
    Tang, Huan
    Wu, Huaqing
    Stuber, Gordon L.
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (02) : 1159 - 1170
  • [34] Wen S, 2012, IEEE INT ICST C COMM
  • [35] Social-Aware Rate Based Content Sharing Mode Selection for D2D Content Sharing Scenarios
    Wu, Dan
    Zhou, Liang
    Cai, Yueming
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (11) : 2571 - 2582
  • [36] Efficiency Resource Allocation for Device-to-Device Underlay Communication Systems: A Reverse Iterative Combinatorial Auction Based Approach
    Xu, Chen
    Song, Lingyang
    Han, Zhu
    Zhao, Qun
    Wang, Xiaoli
    Cheng, Xiang
    Jiao, Bingli
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2013, 31 (09) : 348 - 358
  • [37] Interference-aware resource sharing in D2D underlaying LTE-A networks
    Xu, Shaoyi
    Kwak, Kyung Sup
    Rao, Ramesh
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2015, 26 (12): : 1306 - 1322
  • [38] Dynamic resource allocation for Device-to-Device communication underlaying cellular networks
    Xu, Yanfang
    Yin, Rui
    Han, Tao
    Yu, Guanding
    [J]. INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2014, 27 (10) : 2408 - 2425
  • [39] Mobile Device-to-Device Video Distribution: Theory and Application
    Zhou, Liang
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 12 (03)