Multi-Agent Federated Q-Learning Algorithms for Wireless Edge Caching

被引：0

作者：

Liu, Zhikai ^{[1
]}

Garg, Navneet ^{[2
,3
]}

Ratnarajah, Tharmalingam ^{[1
]}

机构：

[1] Univ Edinburgh, Inst Imaging Data & Commun, Sch Engn, Edinburgh EH8 9YL, Scotland

[2] Univ Edinburgh, Inst Data Image & Commun, Sch Engn, Edinburgh EH8 9YL, Scotland

[3] LNM Inst Informat Technol, Dept Elect & Commun Engn, Jaipur 302004, India

来源：

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY | 2025年 / 74卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

Libraries; Tensors; Heuristic algorithms; Delays; Vehicle dynamics; Computational modeling; Vectors; Real-time systems; Matrix decomposition; Communication system security; Large file library; linear function approximation; massive multiple-input multiple-output; tensor completion; wireless edge caching; NETWORKS; INTERNET;

D O I：

10.1109/TVT.2024.3473738

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Edge caching is an increasingly vital technique in wireless networks, particularly needed to address users' repeated demands, including real-time traffic data and map access in vehicular communications. This paper presents a three-tier system for edge caching, integrating massive multiple-input multiple-output (mMIMO) networks. We consider a scenario with an extensive file library whose size is larger than the aggregated caching capacity of the small base stations (SBSs). Each SBS proactively caches files based on dynamic file popularity that is unknown in advance. A distinguishing aspect of our model is its file-wise approach to the cache instead of the conventional vector-wise methods based on aggregate popularity. This approach introduces additional computational challenges concerning the extensive file library. We formulated the optimization problem to maximize the long-term discounted cache hit rate while minimizing the delivery delay. For the solution, two multi-agent federated Q-learning algorithms are proposed. The first algorithm employs selective updates of the Q-values of popular files to minimize computational overhead. The second algorithm incorporates linear function approximation (LFA) and tensor completion (TC) to streamline the updating process further, reducing the required parameter number. Through the real-world MovieLens dataset and compared with various baseline algorithms, simulations demonstrate that our proposed algorithm can reduce the delay by around 2.60--21.29%, improve the cache hit rate by around 5.71--66.42%, and reduce the computational complexity by at most 97.91%.

引用

页码：2973 / 2988

页数：16

共 67 条

[1] Randomized Algorithms for Computation of Tucker Decomposition and Higher Order SVD (HOSVD)
Ahmadi-Asl, Salman
Abukhovich, Stanislav
Asante-Mensah, Maame G.
Cichocki, Andrzej
Phan, Anh Huy
Tanaka, Tohishisa
Oseledets, Ivan
[J]. IEEE ACCESS, 2021, 9 : 28684 - 28706
[2] UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning
Araf, Sadman
Saha, Adittya Soukarjya
Kazi, Sadia Hamid
Tran, Nguyen H. H.
Alam, Md. Golam Rabiul
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2322 - 2337
[3] An Analysis on Caching Placement for Millimeter-Micro-Wave Hybrid Networks
Biswas, Sudip
Zhang, Tong
Singh, Keshav
Vuppala, Satyanarayana
Ratnarajah, Tharmalingam
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (02) : 1645 - 1662
[4] Carvalho D., 2020, Adv. Neural Inf. Process. Syst., V33, P19412
[5] Carvalho D. S., 2023, MULTIBELLMAN OPERATO
[6] Capacity of Cooperative Vehicular Networks With Infrastructure Support: Multiuser Case
Chen, Jieqiong
Mao, Guoqiang
Li, Changle
Liang, Weifa
Zhang, De-gan
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (02) : 1546 - 1560
[7] A novel offloading scheduling method for mobile application in mobile edge computing
Cui, Yu-ya
Zhang, De-gan
Zhang, Ting
Zhang, Jie
Piao, Mingjie
[J]. WIRELESS NETWORKS, 2022, 28 (06) : 2345 - 2363
[8] New method of vehicle cooperative communication based on fuzzy logic and signaling game strategy
De-Gan, Zhang
Chen-Hao, Ni
Jie, Zhang
Ting, Zhang
Zhi-Hao, Zhang
[J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 131 - 149
[9] Function Approximation Based Reinforcement Learning for Edge Caching in Massive MIMO Networks
Garg, Navneet
Sellathurai, Mathini
Bhatia, Vimal
Ratnarajah, Tharmalingam
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2304 - 2316
[10] Online Content Popularity Prediction and Learning in Wireless Edge Caching
Garg, Navneet
Sellathurai, Mathini
Bhatia, Vimal
Bharath, B. N.
Ratnarajah, Tharmalingam
[J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (02) : 1087 - 1100

← 1 2 3 4 5 6 7 →