Multi-Agent Federated Q-Learning Algorithms for Wireless Edge Caching

被引:0
作者
Liu, Zhikai [1 ]
Garg, Navneet [2 ,3 ]
Ratnarajah, Tharmalingam [1 ]
机构
[1] Univ Edinburgh, Inst Imaging Data & Commun, Sch Engn, Edinburgh EH8 9YL, Scotland
[2] Univ Edinburgh, Inst Data Image & Commun, Sch Engn, Edinburgh EH8 9YL, Scotland
[3] LNM Inst Informat Technol, Dept Elect & Commun Engn, Jaipur 302004, India
基金
英国工程与自然科学研究理事会;
关键词
Libraries; Tensors; Heuristic algorithms; Delays; Vehicle dynamics; Computational modeling; Vectors; Real-time systems; Matrix decomposition; Communication system security; Large file library; linear function approximation; massive multiple-input multiple-output; tensor completion; wireless edge caching; NETWORKS; INTERNET;
D O I
10.1109/TVT.2024.3473738
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Edge caching is an increasingly vital technique in wireless networks, particularly needed to address users' repeated demands, including real-time traffic data and map access in vehicular communications. This paper presents a three-tier system for edge caching, integrating massive multiple-input multiple-output (mMIMO) networks. We consider a scenario with an extensive file library whose size is larger than the aggregated caching capacity of the small base stations (SBSs). Each SBS proactively caches files based on dynamic file popularity that is unknown in advance. A distinguishing aspect of our model is its file-wise approach to the cache instead of the conventional vector-wise methods based on aggregate popularity. This approach introduces additional computational challenges concerning the extensive file library. We formulated the optimization problem to maximize the long-term discounted cache hit rate while minimizing the delivery delay. For the solution, two multi-agent federated Q-learning algorithms are proposed. The first algorithm employs selective updates of the Q-values of popular files to minimize computational overhead. The second algorithm incorporates linear function approximation (LFA) and tensor completion (TC) to streamline the updating process further, reducing the required parameter number. Through the real-world MovieLens dataset and compared with various baseline algorithms, simulations demonstrate that our proposed algorithm can reduce the delay by around 2.60--21.29%, improve the cache hit rate by around 5.71--66.42%, and reduce the computational complexity by at most 97.91%.
引用
收藏
页码:2973 / 2988
页数:16
相关论文
共 67 条
  • [1] Randomized Algorithms for Computation of Tucker Decomposition and Higher Order SVD (HOSVD)
    Ahmadi-Asl, Salman
    Abukhovich, Stanislav
    Asante-Mensah, Maame G.
    Cichocki, Andrzej
    Phan, Anh Huy
    Tanaka, Tohishisa
    Oseledets, Ivan
    [J]. IEEE ACCESS, 2021, 9 : 28684 - 28706
  • [2] UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning
    Araf, Sadman
    Saha, Adittya Soukarjya
    Kazi, Sadia Hamid
    Tran, Nguyen H. H.
    Alam, Md. Golam Rabiul
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2322 - 2337
  • [3] An Analysis on Caching Placement for Millimeter-Micro-Wave Hybrid Networks
    Biswas, Sudip
    Zhang, Tong
    Singh, Keshav
    Vuppala, Satyanarayana
    Ratnarajah, Tharmalingam
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2019, 67 (02) : 1645 - 1662
  • [4] Carvalho D., 2020, Adv. Neural Inf. Process. Syst., V33, P19412
  • [5] Carvalho D. S., 2023, MULTIBELLMAN OPERATO
  • [6] Capacity of Cooperative Vehicular Networks With Infrastructure Support: Multiuser Case
    Chen, Jieqiong
    Mao, Guoqiang
    Li, Changle
    Liang, Weifa
    Zhang, De-gan
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (02) : 1546 - 1560
  • [7] A novel offloading scheduling method for mobile application in mobile edge computing
    Cui, Yu-ya
    Zhang, De-gan
    Zhang, Ting
    Zhang, Jie
    Piao, Mingjie
    [J]. WIRELESS NETWORKS, 2022, 28 (06) : 2345 - 2363
  • [8] New method of vehicle cooperative communication based on fuzzy logic and signaling game strategy
    De-Gan, Zhang
    Chen-Hao, Ni
    Jie, Zhang
    Ting, Zhang
    Zhi-Hao, Zhang
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 131 - 149
  • [9] Function Approximation Based Reinforcement Learning for Edge Caching in Massive MIMO Networks
    Garg, Navneet
    Sellathurai, Mathini
    Bhatia, Vimal
    Ratnarajah, Tharmalingam
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (04) : 2304 - 2316
  • [10] Online Content Popularity Prediction and Learning in Wireless Edge Caching
    Garg, Navneet
    Sellathurai, Mathini
    Bhatia, Vimal
    Bharath, B. N.
    Ratnarajah, Tharmalingam
    [J]. IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (02) : 1087 - 1100