Attention-Weighted Federated Deep Reinforcement Learning for Device-to-Device Assisted Heterogeneous Collaborative Edge Caching

被引：107

作者：

Wang, Xiaofei ^{[1
]}

Li, Ruibin ^{[1
]}

Wang, Chenyang ^{[1
]}

Li, Xiuhua ^{[2
,3
]}

Taleb, Tarik ^{[4
,5
,6
]}

Leung, Victor C. M. ^{[7
,8
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China

[2] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing, Peoples R China

[3] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China

[4] Aalto Univ, Sch Elect Engn, Dept Commun & Networking, Espoo 02150, Finland

[5] Oulu Univ, Dept Informat Technol & Elect Engn, Oulu 90570, Finland

[6] Sejong Univ, Dept Comp & Informat Secur, Seoul 05006, South Korea

[7] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

[8] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada

来源：

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS | 2021年 / 39卷 / 01期

基金：

加拿大自然科学与工程研究理事会; 芬兰科学院;

关键词：

Device-to-device communication; Data models; Collaboration; Servers; Delays; Computational modeling; Training; Edge caching; device to device; attention-weighted federated learning; deep reinforcement learning; INTERNET;

D O I：

10.1109/JSAC.2020.3036946

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In order to meet the growing demands for multimedia service access and release the pressure of the core network, edge caching and device-to-device (D2D) communication have been regarded as two promising techniques in next generation mobile networks and beyond. However, most existing related studies lack consideration of effective cooperation and adaptability to the dynamic network environments. In this article, based on the flexible trilateral cooperation among user equipment, edge base stations and a cloud server, we propose a D2D-assisted heterogeneous collaborative edge caching framework by jointly optimizing the node selection and cache replacement in mobile networks. We formulate the joint optimization problem as a Markov decision process, and use a deep Q-learning network to solve the long-term mixed integer linear programming problem. We further design an attention-weighted federated deep reinforcement learning (AWFDRL) model that uses federated learning to improve the training efficiency of the Q-learning network by considering the limited computing and storage capacity, and incorporates an attention mechanism to optimize the aggregation weights to avoid the imbalance of local model quality. We prove the convergence of the corresponding algorithm, and present simulation results to show the effectiveness of the proposed AWFDRL framework in reducing average delay of content access, improving hit rate and offloading traffic.

引用

页码：154 / 169

页数：16

共 42 条

[1]

[Anonymous], 2004, ALGORITHM DESIGN

[2]

[Anonymous], 2000, CARN MELL U DAT PRIV

[3]

[Anonymous], 1999, ACM SIGCOMM COMP COM

[4]

Arlitt M, 2000, PERF E R SI, V27, P3, DOI [10.1145/346000.346003, 10.1145/362883.362920]

[5]

Bahdanau D., 2014, ABS14090473 CORR

[6] 3D Point Cloud Retrieval With Bidirectional Feature Match [J].

Bold, Naranchimeg ;

Zhang, Chao ;

Akashi, Takuya .

IEEE ACCESS, 2019, 7 :164194-164202

[7] Cooperative cache-based data access in ad hoc networks [J].

Cao, GH ;

Yin, LZ ;

Das, CR .

COMPUTER, 2004, 37 (02) :32-+

[8] Communication-Efficient Federated Deep Learning With Layerwise Asynchronous Model Update and Temporally Weighted Aggregation [J].

Chen, Yang ;

Sun, Xiaoyan ;

Jin, Yaochu .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (10) :4229-4238

[9] Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence [J].

Deng, Shuiguang ;

Zhao, Hailiang ;

Fang, Weijia ;

Yin, Jianwei ;

Dustdar, Schahram ;

Zomaya, Albert Y. .

IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (08) :7457-7469

[10]

Govier T., 1997, MCGILL QUEENS PRESS

← 1 2 3 4 5 →