Recommendation-Enabled Edge Caching and D2D Offloading via Incentive-Driven Deep Reinforcement Learning

被引:0
|
作者
Wu, Tong [1 ]
Yu, Dongjin [1 ]
Liu, Chengfei [2 ]
Wang, Dongjing [1 ]
Huang, Binbin [1 ]
机构
[1] Hangzhou Dianzi Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Swinburne Univ Technol, Dept Comp Technol, Melbourne, Vic 3122, Australia
基金
中国国家自然科学基金;
关键词
Device-to-device communication; Costs; Prediction algorithms; Predictive models; Reinforcement learning; Sparse matrices; Quality of experience; Device-to-Device; edge caching; incentive mechanism; recommendation; reinforcement learning;
D O I
10.1109/TSC.2024.3351219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes a novel architecture of Recommendation-Enabled Edge Caching and Device-to-Device (D2D) Offloading via Incentive-driven Deep Reinforcement Learning (DRL), which can not only solve the problem of inaccurate recommendation caused by sparse rating matrix, but also encourage users to participate in D2D offloading through an effective incentive mechanism. Specifically, we define Pseudo Markov Decision Process (PMDP) for the first time, which enables the conversion of the non-sequential process (e.g. rating prediction) into a sequential one, making it suitable for DRL. Then, combining Supervised Learning (SL) and DRL, a Supervised DRL for Collaborative Filtering (CF) algorithm, named SDRLCF, is proposed to predict missing ratings. After that, from the perspective of Content Service Center (CSC), the incentive-driven recommendation-enabled edge caching and D2D offloading can be formulated as a Non-Linear Integer Programming (NLIP) problem, which belongs to NP-hard, and is difficult to obtain the optimal solution in polynomial time. To address this issue, a DRL based Edge Caching and Recommendation algorithm, named DRLECR, is proposed to minimize the cost of CSC. Finally, combining with economic theory, a Reverse Auction based Payment Determination algorithm under Vickrey-Clarke-Groves (VCG) scheme, named RAPD, is proposed, which can stimulate users to participate in edge caching and D2D offloading while guaranteeing the individual rationality and truthfulness of participants. Extensive experiment results on both realistic and synthetic datasets demonstrate that the proposed algorithms outperform other baseline methods under different scenarios.
引用
收藏
页码:1724 / 1738
页数:15
相关论文
共 50 条
  • [41] Balancing Fairness and Energy Efficiency in SWIPT-Based D2D Networks: Deep Reinforcement Learning Based Approach
    Han, Eun-Jeong
    Sengly, Muy
    Lee, Jung-Ryun
    IEEE ACCESS, 2022, 10 : 64495 - 64503
  • [42] Rado: A Randomized Auction Approach for Data Offloading via D2D Communication
    Zhu, Yifei
    Jiang, Jingjie
    Li, Bo
    Li, Baochun
    2015 IEEE 12TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2015, : 1 - 9
  • [43] Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning
    Han, Ruijian
    Chen, Kani
    Tan, Chunxi
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2020, 73 (03) : 522 - 540
  • [44] Fast and Reliable Offloading via Deep Reinforcement Learning for Mobile Edge Video Computing
    Park, Soohyun
    Kang, Yeongeun
    Tian, Yafei
    Kim, Joongheon
    2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 10 - 12
  • [45] On Cost Minimization for Cache-Enabled D2D Networks with Recommendation
    Hua, Yu
    Fu, Yaru
    Zhu, Qi
    CHINA COMMUNICATIONS, 2022, 19 (11) : 257 - 267
  • [46] Computation Offloading Scheme with D2D for MEC-enabled Cellular Networks
    Tong, Minglei
    Wang, Xiaoxiang
    Wang, Yulong
    Lan, Yanwen
    2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2020, : 111 - 116
  • [47] S-MFRL: Spiking Mean Field Reinforcement Learning for Dynamic Resource Allocation of D2D Networks
    Ye, Pei-Gen
    Wang, Yuan-Gen
    Tang, Weixuan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 1032 - 1047
  • [48] Energy-Efficient Content Fetching Strategies in Cache-Enabled D2D Networks via an Actor-Critic Reinforcement Learning Structure
    Yan, Ming
    Luo, Meiqi
    Chan, Chien Aun
    Gygax, Andre F.
    Li, Chunguo
    Chih-Lin, I
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (11) : 17485 - 17495
  • [49] A joint multicast/D2D learning-based approach to LTE traffic offloading
    Rebecchi, Filippo
    Valerio, Lorenzo
    Bruno, Raffaele
    Conan, Vania
    de Amorim, Marcelo Dias
    Passarella, Andrea
    COMPUTER COMMUNICATIONS, 2015, 72 : 26 - 37
  • [50] Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks
    Sun, Ming
    Jin, Yanhui
    Wang, Shumei
    Mei, Erzhuang
    ENTROPY, 2022, 24 (12)