Recommendation-Enabled Edge Caching and D2D Offloading via Incentive-Driven Deep Reinforcement Learning

被引：0

作者：

Wu, Tong ^{[1
]}

Yu, Dongjin ^{[1
]}

Liu, Chengfei ^{[2
]}

Wang, Dongjing ^{[1
]}

Huang, Binbin ^{[1
]}

机构：

[1] Hangzhou Dianzi Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China

[2] Swinburne Univ Technol, Dept Comp Technol, Melbourne, Vic 3122, Australia

来源：

IEEE TRANSACTIONS ON SERVICES COMPUTING | 2024年 / 17卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Device-to-device communication; Costs; Prediction algorithms; Predictive models; Reinforcement learning; Sparse matrices; Quality of experience; Device-to-Device; edge caching; incentive mechanism; recommendation; reinforcement learning;

D O I：

10.1109/TSC.2024.3351219

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article proposes a novel architecture of Recommendation-Enabled Edge Caching and Device-to-Device (D2D) Offloading via Incentive-driven Deep Reinforcement Learning (DRL), which can not only solve the problem of inaccurate recommendation caused by sparse rating matrix, but also encourage users to participate in D2D offloading through an effective incentive mechanism. Specifically, we define Pseudo Markov Decision Process (PMDP) for the first time, which enables the conversion of the non-sequential process (e.g. rating prediction) into a sequential one, making it suitable for DRL. Then, combining Supervised Learning (SL) and DRL, a Supervised DRL for Collaborative Filtering (CF) algorithm, named SDRLCF, is proposed to predict missing ratings. After that, from the perspective of Content Service Center (CSC), the incentive-driven recommendation-enabled edge caching and D2D offloading can be formulated as a Non-Linear Integer Programming (NLIP) problem, which belongs to NP-hard, and is difficult to obtain the optimal solution in polynomial time. To address this issue, a DRL based Edge Caching and Recommendation algorithm, named DRLECR, is proposed to minimize the cost of CSC. Finally, combining with economic theory, a Reverse Auction based Payment Determination algorithm under Vickrey-Clarke-Groves (VCG) scheme, named RAPD, is proposed, which can stimulate users to participate in edge caching and D2D offloading while guaranteeing the individual rationality and truthfulness of participants. Extensive experiment results on both realistic and synthetic datasets demonstrate that the proposed algorithms outperform other baseline methods under different scenarios.

引用

页码：1724 / 1738

页数：15

共 50 条

[31] Task graph offloading via deep reinforcement learning in mobile edge computing
Liu, Jiagang
Mi, Yun
Zhang, Xinyu
Li, Xiaocui
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 158 : 545 - 555
[32] Blockchain and digital twin empowered edge caching for D2D wireless networks
Du, Jianbo
Yu, Zuting
Li, Shulei
Hu, Bintao
Gao, Yuan
Chu, Xiaoli
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 166
[33] CoPace: Edge Computation Offloading and Caching for Self-Driving With Deep Reinforcement Learning
Tian, Hao
Xu, Xiaolong
Qi, Lianyong
Zhang, Xuyun
Dou, Wanchun
Yu, Shui
Ni, Qiang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 13281 - 13293
[34] Distributed Video Content Caching Policy With Deep Learning Approaches for D2D Communication
Liu, Zhikai
Song, Hui
Pan, Daru
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 15644 - 15655
[35] A Deep Reinforcement Learning-Based Transcoder Selection Framework for Blockchain-Enabled Wireless D2D Transcoding
Liu, Mengting
Teng, Yinglei
Yu, F. Richard
Leung, Victor C. M.
Song, Mei
IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (06) : 3426 - 3439
[36] D2D Resource Allocation Based on Reinforcement Learning and QoS
Kuo, Fang-Chang
Wang, Hwang-Cheng
Tseng, Chih-Cheng
Wu, Jung-Shyr
Xu, Jia-Hao
Chang, Jieh-Ren
MOBILE NETWORKS & APPLICATIONS, 2023, 28 (03) : 1076 - 1095
[37] Joint Caching and Recommendation Optimization From Network and User Perspectives in Wireless D2D Networks
Yang, Ming-Hsueh
Lee, Ming-Chun
Hong, Y. -W. Peter
IEEE TRANSACTIONS ON COMMUNICATIONS, 2025, 73 (02) : 1233 - 1247
[38] D2D Resource Allocation Based on Reinforcement Learning and QoS
Fang-Chang Kuo
Hwang-Cheng Wang
Chih-Cheng Tseng
Jung-Shyr Wu
Jia-Hao Xu
Jieh-Ren Chang
Mobile Networks and Applications, 2023, 28 : 1076 - 1095
[39] Load balancing in D2D networks Using Reinforcement Learning
Barros, Pedro H.
Cardoso-Pereira, Isadora
Foschini, Luca
Corradi, Antonio
Ramos, Heitor S.
2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 316 - 321
[40] Make Smart Decisions Faster: Deciding D2D Resource Allocation via Stackelberg Game Guided Multi-Agent Deep Reinforcement Learning
Shi, Dian
Li, Liang
Ohtsuki, Tomoaki
Pan, Miao
Han, Zhu
Poor, H. Vincent
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (12) : 4426 - 4438

← 1 2 3 4 5 →