Stochastic Game Based Cooperative Alternating Q-Learning Caching in Dynamic D2D Networks

被引:15
|
作者
Zhang, Tiankui [1 ]
Fang, Xinyuan [1 ]
Wang, Ziduan [1 ]
Liu, Yuanwei [2 ]
Nallanathan, Arumugam [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Queen Mary Univ London, London E1 4NS, England
基金
中国国家自然科学基金;
关键词
Games; Device-to-device communication; Stochastic processes; Cellular networks; Heuristic algorithms; Vehicle dynamics; System performance; Cache placement; device-to-device communication; edge caching; stochastic game; CELLULAR NETWORKS; EDGE; COORDINATION; POLICY;
D O I
10.1109/TVT.2021.3120292
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge caching has become an effective solution to cope with the challenges brought by the massive content delivery in cellular networks. In device-to-device (D2D) enabled caching cellular networks with time-varying content popularity distribution and user terminal (UT) location, we model these dynamic networks as a stochastic game to design a cooperative cache placement policy. The cache placement reward of each UT is defined as the caching incentive minus the transmission power cost for content caching and sharing. We consider the long-term cache placement reward of all UTs in this stochastic game. In an effort to solve the stochastic game problem, we propose a multi-agent cooperative alternating Q-learning (CAQL) based cache placement algorithm. The caching control unit is defined to execute the proposed CAQL, in which, the cache placement policy of each UT is alternatively updated according to the stable policy of other UTs during the learning process, until the stable cache placement policy of all the UTs in the cell is obtained. We discuss the convergence and complexity of CAQL, which obtains the stable cache placement policy with low space complexity. Simulation results show that the proposed algorithm can effectively reduce the backhaul load and the average content access delay in dynamic networks.
引用
收藏
页码:13255 / 13269
页数:15
相关论文
共 50 条
  • [21] Caching strategy based on transmission delay for D2D cooperative edge caching system
    Cai, Yan
    Wu, Fan
    Zhu, Hongbo
    Tongxin Xuebao/Journal on Communications, 2021, 42 (03): : 183 - 189
  • [22] A Deep-Reinforcement-Learning-Based Social-Aware Cooperative Caching Scheme in D2D Communication Networks
    Bai, Yalu
    Wang, Dan
    Huang, Gang
    Song, Bin
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (11) : 9634 - 9645
  • [23] RESOURCE ALLOCATION FOR D2D COMMUNICATIONS WITH A NOVEL DISTRIBUTED Q-LEARNING ALGORITHM IN HETEROGENEOUS NETWORKS
    Huang, Yung-Fa
    Tan, Tan-Hsu
    Wang, Neng-Chung
    Chen, Young-Long
    Li, Yu-Ling
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 533 - 537
  • [24] Local cooperative caching policies in multi-hop D2D networks
    Iqbal, Javed
    Giaccone, Paolo
    Rossi, Claudio
    2014 IEEE 10TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB), 2014, : 245 - 250
  • [25] Caching Incentive Design in Wireless D2D Networks: A Stackelberg Game Approach
    Chen, Zhuoqun
    Liu, Yangyang
    Zhou, Bo
    Tao, Meixia
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [26] An adaptive Q-learning Approach to Power Control for D2D communications
    Toumi, Salwa
    Hamdi, Monia
    Zaied, Mourad
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 206 - 209
  • [27] Dynamic Caching Content Replacement in Base Station Assisted Wireless D2D Caching Networks
    Lee, Ming-Chun
    Feng, Hao
    Molisch, Andreas F.
    IEEE ACCESS, 2020, 8 : 33909 - 33925
  • [28] Optimal Caching Placement in D2D Networks
    Chedia, Jarray
    Belgacem, Chibani
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 187 - 191
  • [29] Learning for Matching Game in Cooperative D2D Communication With Incomplete Information
    Yuan, Yiling
    Yang, Tao
    Feng, Hui
    Hu, Bo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (07) : 7174 - 7178
  • [30] Learning Automata Based Q-Learning for Content Placement in Cooperative Caching
    Yang, Zhong
    Liu, Yuanwei
    Chen, Yue
    Jiao, Lei
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3667 - 3680