Stochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks

被引:15
|
作者
Zhang, Tiankui [1 ]
Fang, Xinyuan [1 ]
Wang, Ziduan [1 ]
Liu, Yuanwei [2 ]
Nallanathan, Arumugam [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Queen Mary Univ London, London E1 4NS, England
基金
中国国家自然科学基金;
关键词
Games; Device-to-device communication; Stochastic processes; Cellular networks; Heuristic algorithms; Vehicle dynamics; System performance; Cache placement; device-to-device communication; edge caching; stochastic game; CELLULAR NETWORKS; EDGE; COORDINATION; POLICY;
D O I
10.1109/TVT.2021.3120292
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge caching has become an effective solution to cope with the challenges brought by the massive content delivery in cellular networks. In device-to-device (D2D) enabled caching cellular networks with time-varying content popularity distribution and user terminal (UT) location, we model these dynamic networks as a stochastic game to design a cooperative cache placement policy. The cache placement reward of each UT is defined as the caching incentive minus the transmission power cost for content caching and sharing. We consider the long-term cache placement reward of all UTs in this stochastic game. In an effort to solve the stochastic game problem, we propose a multi-agent cooperative alternating Q-learning (CAQL) based cache placement algorithm. The caching control unit is defined to execute the proposed CAQL, in which, the cache placement policy of each UT is alternatively updated according to the stable policy of other UTs during the learning process, until the stable cache placement policy of all the UTs in the cell is obtained. We discuss the convergence and complexity of CAQL, which obtains the stable cache placement policy with low space complexity. Simulation results show that the proposed algorithm can effectively reduce the backhaul load and the average content access delay in dynamic networks.
引用
收藏
页码:13255 / 13269
页数:15
相关论文
共 50 条
  • [1] Q-Learning based Edge Caching Optimization for D2D Enabled Hierarchical Wireless Networks
    Wang, Chenyang
    Wang, Shanjia
    Li, Ding
    Wang, Xiaofei
    Li, Xiuhua
    Leung, Victor C. M.
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2018, : 55 - 63
  • [2] Multi-agent Cooperative Alternating Q-learning Caching in D2D-enabled Cellular Networks
    Fang, Xinyuan
    Zhang, Tiankui
    Liu, Yuanwei
    Zeng, Zhimin
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [3] Cooperative caching game based on social trust for D2D communication networks
    Lu Weifeng
    Zhu Mingqi
    Xu Jia
    Chen Siguang
    Yang Lijun
    Xu Jian
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2020, 33 (09)
  • [4] DYNAMIC RESOURCE ALLOCATIONS BASED ON Q-LEARNING FOR D2D COMMUNICATION IN CELLULAR NETWORKS
    Luo, Yong
    Shi, Zhiping
    Zhou, Xin
    Liu, Qiaoyan
    Yi, Qicong
    2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 385 - 388
  • [5] Preference-Aware Caching Based on Cooperative Game for D2D Communication Networks
    Fan, Hongmei
    Zhang, Tiankui
    Loo, Jonathan
    Liu, Dantong
    Yang, Liwei
    2018 IEEE 87TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2018,
  • [6] A power allocation algorithm based on cooperative Q-learning for multi-agent D2D communication networks
    Dou, Zheng
    Si, Guangzhen
    Lin, Yun
    Wang, Meiyu
    PHYSICAL COMMUNICATION, 2021, 47
  • [7] Performance of Resource Allocation for D2D Communications in Q-Learning Based Heterogeneous Networks
    Huang, Yung-Fa
    Tan, Tan-Hsu
    Li, Yu-Ling
    Huang, Shao-Chieh
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [8] Performance of Q-learning based resource allocation for D2D communications in heterogeneous networks
    Lee, Shu-Hung
    Shi, Xiao-Pei
    Tan, Tan-Hsu
    Lee, Yu-Lin
    Huang, Yung-Fa
    ICT EXPRESS, 2023, 9 (06): : 1032 - 1039
  • [9] Learning to Cooperate in D2D Caching Networks
    Paschos, Georgios S.
    Destounis, Apostolos
    Iosifidis, George
    2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,
  • [10] A Game-Theoretic Approach for Optimal Distributed Cooperative Hybrid Caching in D2D Networks
    Zhang, Yuli
    Xu, Yuhua
    Wu, Qihui
    Liu, Xin
    Yao, Kailing
    Anpalagan, Alagan
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2018, 7 (03) : 324 - 327