Stochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks

被引:15
|
作者
Zhang, Tiankui [1 ]
Fang, Xinyuan [1 ]
Wang, Ziduan [1 ]
Liu, Yuanwei [2 ]
Nallanathan, Arumugam [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Queen Mary Univ London, London E1 4NS, England
基金
中国国家自然科学基金;
关键词
Games; Device-to-device communication; Stochastic processes; Cellular networks; Heuristic algorithms; Vehicle dynamics; System performance; Cache placement; device-to-device communication; edge caching; stochastic game; CELLULAR NETWORKS; EDGE; COORDINATION; POLICY;
D O I
10.1109/TVT.2021.3120292
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge caching has become an effective solution to cope with the challenges brought by the massive content delivery in cellular networks. In device-to-device (D2D) enabled caching cellular networks with time-varying content popularity distribution and user terminal (UT) location, we model these dynamic networks as a stochastic game to design a cooperative cache placement policy. The cache placement reward of each UT is defined as the caching incentive minus the transmission power cost for content caching and sharing. We consider the long-term cache placement reward of all UTs in this stochastic game. In an effort to solve the stochastic game problem, we propose a multi-agent cooperative alternating Q-learning (CAQL) based cache placement algorithm. The caching control unit is defined to execute the proposed CAQL, in which, the cache placement policy of each UT is alternatively updated according to the stable policy of other UTs during the learning process, until the stable cache placement policy of all the UTs in the cell is obtained. We discuss the convergence and complexity of CAQL, which obtains the stable cache placement policy with low space complexity. Simulation results show that the proposed algorithm can effectively reduce the backhaul load and the average content access delay in dynamic networks.
引用
收藏
页码:13255 / 13269
页数:15
相关论文
共 50 条
  • [41] Fundamental Limits of Caching in Wireless D2D Networks
    Ji, Mingyue
    Caire, Giuseppe
    Molisch, Andreas F.
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2016, 62 (02) : 849 - 869
  • [42] Relay selection algorithm based on social network combined with Q-learning for vehicle D2D communication
    Qian, Hongzhi
    Yu, Jinming
    Hua, Licheng
    IET COMMUNICATIONS, 2019, 13 (20) : 3582 - 3587
  • [43] Mobility-Aware Caching in D2D Networks
    Wang, Rui
    Zhang, Jun
    Song, S. H.
    Letaief, Khaled B.
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (08) : 5001 - 5015
  • [44] IBP Based Caching Strategy in D2D
    Shan, Chun
    Wu, Xiao-ping
    Liu, Yan
    Cai, Jun
    Luo, Jian-zhen
    APPLIED SCIENCES-BASEL, 2019, 9 (12):
  • [45] Multi-Agent Collaborative Caching Strategies in Dynamic Heterogeneous D2D Networks
    Fan, Xinglong
    Chen, Honglong
    Ni, Zhichen
    Li, Guoxin
    Sun, Haiyang
    Yu, Jiguo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (05) : 7204 - 7217
  • [46] A Novel Distributed Q-Learning Based Resource Reservation Framework for Facilitating D2D Content Access Requests in LTE-A Networks
    Kumar, Naveen
    Swain, Siba Narayan
    Murthy, C. Siva Ram
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2018, 15 (02): : 718 - 731
  • [47] A Delay-Aware Caching Algorithm for Wireless D2D Caching Networks
    Li, Yi
    Gursoy, M. Cenk
    Velipasalar, Senem
    2017 IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2017, : 456 - 461
  • [48] An Influence Factor Based Caching Node Selection Algorithm in D2D Networks
    Fang, Tao
    Tian, Hua
    Yang, Yang
    Liu, Xin
    Wu, Ducheng
    Chen, Xueqiang
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 805 - 809
  • [49] Edge Caching for D2D Enabled Hierarchical Wireless Networks with Deep Reinforcement Learning
    Li, Wenkai
    Wang, Chenyang
    Li, Ding
    Hu, Bin
    Wang, Xiaofei
    Ren, Jianji
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2019, 2019
  • [50] Optimal D2D Resource Allocation in Heterogeneous Cellular Networks by Decentralized Multi-Agent Deep Q-Learning
    Akhoundzadeh, Pouya
    Mirjalily, Ghasem
    Sadeghi, Mohammad Taghi
    2024 32ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, ICEE 2024, 2024, : 739 - 743