Recommendation-Enabled Edge Caching and D2D Offloading via Incentive-Driven Deep Reinforcement Learning

被引:0
|
作者
Wu, Tong [1 ]
Yu, Dongjin [1 ]
Liu, Chengfei [2 ]
Wang, Dongjing [1 ]
Huang, Binbin [1 ]
机构
[1] Hangzhou Dianzi Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Swinburne Univ Technol, Dept Comp Technol, Melbourne, Vic 3122, Australia
基金
中国国家自然科学基金;
关键词
Device-to-device communication; Costs; Prediction algorithms; Predictive models; Reinforcement learning; Sparse matrices; Quality of experience; Device-to-Device; edge caching; incentive mechanism; recommendation; reinforcement learning;
D O I
10.1109/TSC.2024.3351219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes a novel architecture of Recommendation-Enabled Edge Caching and Device-to-Device (D2D) Offloading via Incentive-driven Deep Reinforcement Learning (DRL), which can not only solve the problem of inaccurate recommendation caused by sparse rating matrix, but also encourage users to participate in D2D offloading through an effective incentive mechanism. Specifically, we define Pseudo Markov Decision Process (PMDP) for the first time, which enables the conversion of the non-sequential process (e.g. rating prediction) into a sequential one, making it suitable for DRL. Then, combining Supervised Learning (SL) and DRL, a Supervised DRL for Collaborative Filtering (CF) algorithm, named SDRLCF, is proposed to predict missing ratings. After that, from the perspective of Content Service Center (CSC), the incentive-driven recommendation-enabled edge caching and D2D offloading can be formulated as a Non-Linear Integer Programming (NLIP) problem, which belongs to NP-hard, and is difficult to obtain the optimal solution in polynomial time. To address this issue, a DRL based Edge Caching and Recommendation algorithm, named DRLECR, is proposed to minimize the cost of CSC. Finally, combining with economic theory, a Reverse Auction based Payment Determination algorithm under Vickrey-Clarke-Groves (VCG) scheme, named RAPD, is proposed, which can stimulate users to participate in edge caching and D2D offloading while guaranteeing the individual rationality and truthfulness of participants. Extensive experiment results on both realistic and synthetic datasets demonstrate that the proposed algorithms outperform other baseline methods under different scenarios.
引用
收藏
页码:1724 / 1738
页数:15
相关论文
共 50 条
  • [31] Task graph offloading via deep reinforcement learning in mobile edge computing
    Liu, Jiagang
    Mi, Yun
    Zhang, Xinyu
    Li, Xiaocui
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 158 : 545 - 555
  • [32] Blockchain and digital twin empowered edge caching for D2D wireless networks
    Du, Jianbo
    Yu, Zuting
    Li, Shulei
    Hu, Bintao
    Gao, Yuan
    Chu, Xiaoli
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
  • [33] CoPace: Edge Computation Offloading and Caching for Self-Driving With Deep Reinforcement Learning
    Tian, Hao
    Xu, Xiaolong
    Qi, Lianyong
    Zhang, Xuyun
    Dou, Wanchun
    Yu, Shui
    Ni, Qiang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 13281 - 13293
  • [34] Distributed Video Content Caching Policy With Deep Learning Approaches for D2D Communication
    Liu, Zhikai
    Song, Hui
    Pan, Daru
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 15644 - 15655
  • [35] A Deep Reinforcement Learning-Based Transcoder Selection Framework for Blockchain-Enabled Wireless D2D Transcoding
    Liu, Mengting
    Teng, Yinglei
    Yu, F. Richard
    Leung, Victor C. M.
    Song, Mei
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3426 - 3439
  • [36] D2D Resource Allocation Based on Reinforcement Learning and QoS
    Kuo, Fang-Chang
    Wang, Hwang-Cheng
    Tseng, Chih-Cheng
    Wu, Jung-Shyr
    Xu, Jia-Hao
    Chang, Jieh-Ren
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (03) : 1076 - 1095
  • [37] Joint Caching and Recommendation Optimization From Network and User Perspectives in Wireless D2D Networks
    Yang, Ming-Hsueh
    Lee, Ming-Chun
    Hong, Y. -W. Peter
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2025, 73 (02) : 1233 - 1247
  • [38] D2D Resource Allocation Based on Reinforcement Learning and QoS
    Fang-Chang Kuo
    Hwang-Cheng Wang
    Chih-Cheng Tseng
    Jung-Shyr Wu
    Jia-Hao Xu
    Jieh-Ren Chang
    Mobile Networks and Applications, 2023, 28 : 1076 - 1095
  • [39] Load balancing in D2D networks Using Reinforcement Learning
    Barros, Pedro H.
    Cardoso-Pereira, Isadora
    Foschini, Luca
    Corradi, Antonio
    Ramos, Heitor S.
    2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 316 - 321
  • [40] Make Smart Decisions Faster: Deciding D2D Resource Allocation via Stackelberg Game Guided Multi-Agent Deep Reinforcement Learning
    Shi, Dian
    Li, Liang
    Ohtsuki, Tomoaki
    Pan, Miao
    Han, Zhu
    Poor, H. Vincent
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (12) : 4426 - 4438