Recommendation-Enabled Edge Caching and D2D Offloading via Incentive-Driven Deep Reinforcement Learning

被引：0

作者：

Wu, Tong ^{[1
]}

Yu, Dongjin ^{[1
]}

Liu, Chengfei ^{[2
]}

Wang, Dongjing ^{[1
]}

Huang, Binbin ^{[1
]}

机构：

[1] Hangzhou Dianzi Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China

[2] Swinburne Univ Technol, Dept Comp Technol, Melbourne, Vic 3122, Australia

来源：

IEEE TRANSACTIONS ON SERVICES COMPUTING | 2024年 / 17卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Device-to-device communication; Costs; Prediction algorithms; Predictive models; Reinforcement learning; Sparse matrices; Quality of experience; Device-to-Device; edge caching; incentive mechanism; recommendation; reinforcement learning;

D O I：

10.1109/TSC.2024.3351219

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article proposes a novel architecture of Recommendation-Enabled Edge Caching and Device-to-Device (D2D) Offloading via Incentive-driven Deep Reinforcement Learning (DRL), which can not only solve the problem of inaccurate recommendation caused by sparse rating matrix, but also encourage users to participate in D2D offloading through an effective incentive mechanism. Specifically, we define Pseudo Markov Decision Process (PMDP) for the first time, which enables the conversion of the non-sequential process (e.g. rating prediction) into a sequential one, making it suitable for DRL. Then, combining Supervised Learning (SL) and DRL, a Supervised DRL for Collaborative Filtering (CF) algorithm, named SDRLCF, is proposed to predict missing ratings. After that, from the perspective of Content Service Center (CSC), the incentive-driven recommendation-enabled edge caching and D2D offloading can be formulated as a Non-Linear Integer Programming (NLIP) problem, which belongs to NP-hard, and is difficult to obtain the optimal solution in polynomial time. To address this issue, a DRL based Edge Caching and Recommendation algorithm, named DRLECR, is proposed to minimize the cost of CSC. Finally, combining with economic theory, a Reverse Auction based Payment Determination algorithm under Vickrey-Clarke-Groves (VCG) scheme, named RAPD, is proposed, which can stimulate users to participate in edge caching and D2D offloading while guaranteeing the individual rationality and truthfulness of participants. Extensive experiment results on both realistic and synthetic datasets demonstrate that the proposed algorithms outperform other baseline methods under different scenarios.

引用

页码：1724 / 1738

页数：15

共 50 条

[41] Balancing Fairness and Energy Efficiency in SWIPT-Based D2D Networks: Deep Reinforcement Learning Based Approach
Han, Eun-Jeong
Sengly, Muy
Lee, Jung-Ryun
IEEE ACCESS, 2022, 10 : 64495 - 64503
[42] Rado: A Randomized Auction Approach for Data Offloading via D2D Communication
Zhu, Yifei
Jiang, Jingjie
Li, Bo
Li, Baochun
2015 IEEE 12TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS), 2015, : 1 - 9
[43] Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning
Han, Ruijian
Chen, Kani
Tan, Chunxi
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2020, 73 (03) : 522 - 540
[44] Fast and Reliable Offloading via Deep Reinforcement Learning for Mobile Edge Video Computing
Park, Soohyun
Kang, Yeongeun
Tian, Yafei
Kim, Joongheon
2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 10 - 12
[45] On Cost Minimization for Cache-Enabled D2D Networks with Recommendation
Hua, Yu
Fu, Yaru
Zhu, Qi
CHINA COMMUNICATIONS, 2022, 19 (11) : 257 - 267
[46] Computation Offloading Scheme with D2D for MEC-enabled Cellular Networks
Tong, Minglei
Wang, Xiaoxiang
Wang, Yulong
Lan, Yanwen
2020 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS), 2020, : 111 - 116
[47] S-MFRL: Spiking Mean Field Reinforcement Learning for Dynamic Resource Allocation of D2D Networks
Ye, Pei-Gen
Wang, Yuan-Gen
Tang, Weixuan
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 1032 - 1047
[48] Energy-Efficient Content Fetching Strategies in Cache-Enabled D2D Networks via an Actor-Critic Reinforcement Learning Structure
Yan, Ming
Luo, Meiqi
Chan, Chien Aun
Gygax, Andre F.
Li, Chunguo
Chih-Lin, I
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (11) : 17485 - 17495
[49] A joint multicast/D2D learning-based approach to LTE traffic offloading
Rebecchi, Filippo
Valerio, Lorenzo
Bruno, Raffaele
Conan, Vania
de Amorim, Marcelo Dias
Passarella, Andrea
COMPUTER COMMUNICATIONS, 2015, 72 : 26 - 37
[50] Joint Deep Reinforcement Learning and Unsupervised Learning for Channel Selection and Power Control in D2D Networks
Sun, Ming
Jin, Yanhui
Wang, Shumei
Mei, Erzhuang
ENTROPY, 2022, 24 (12)

← 1 2 3 4 5 →