Caching for Doubly Selective Fading Channels via Model-Agnostic Meta-Reinforcement Learning

被引:0
作者
He, Weibao [1 ]
Zhou, Fasheng [2 ]
Tang, Dong [2 ]
Fang, Fang [3 ,4 ]
Chen, Wei [5 ]
机构
[1] Guangzhou Univ, Sch Phys & Mat Sci, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Sch Elect & Commun Engn, Guangzhou 510006, Peoples R China
[3] Western Univ, Dept Elect & Comp Engn, London, ON N6A 3K7, Canada
[4] Western Univ, Dept Comp Sci, London, ON N6A 3K7, Canada
[5] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
来源
IEEE SYSTEMS JOURNAL | 2024年 / 18卷 / 03期
基金
中国国家自然科学基金;
关键词
Doubly-selective fading; meta-reinforcement learning; mobile edge caching; transmission delay; wideband communications; CONTENT PLACEMENT; OFDM SYSTEMS; WIRELESS; PERFORMANCE; EDGE; STRATEGIES; DELIVERY;
D O I
10.1109/JSYST.2024.3442958
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge caching is expected to alleviate the traffic consumption in next-generation communications. In this article, we consider the transmission delay in wideband communications deteriorated by rapid user movements, where the frequency-selective wideband fading channels become fast time-varying and hence doubly-selective due to the user movements. To preferably allocate the caching resource in such circumstance, we introduce a coordinated caching network and accordingly formulate an allocation problem. However, the formulated problem is shown to be NP-hard. By considering the extremely high computational complexity to solve the NP-hard problem by traditional optimization algorithm, and considering only a few samples can be obtained for each training instance due to shortened coherence-time in the dynamical doubly selective fading channels, we propose a model-agnostic meta-reinforcement learning method to address the formulated problem. Particularly, the proposed method can efficiently recognize the unstable mobile channels and accordingly cache to reduce the overall transmission delay while only requires a few training samples. Numerical simulations are performed to verify the effectiveness of the proposed method and results show that the proposed one outperforms the commonly adopted existing method of deep-deterministic-policy-gradient learning in terms of average delay and cache hit rate.
引用
收藏
页码:1776 / 1785
页数:10
相关论文
共 44 条
[1]   Mobility Modeling and Stochastic Property Analysis of Airborne Network [J].
Anjum, M. D. Nashid ;
Wang, Honggang .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (03) :1282-1294
[2]  
[Anonymous], 2005, Wireless Communications
[3]   Winding Function Based Analytical Model of Squirrel Cage Induction Motor for Fault Diagnostics [J].
Asad, Bilal ;
Vaimann, Toomas ;
Kallaste, Ants ;
Rassolkin, Anton ;
Belahcen, Anouar .
2019 26TH INTERNATIONAL WORKSHOP ON ELECTRIC DRIVES: IMPROVEMENT IN EFFICIENCY OF ELECTRIC DRIVES (IWED) PROCEEDINGS, 2019,
[4]   THE THEORY OF DYNAMIC PROGRAMMING [J].
BELLMAN, R .
BULLETIN OF THE AMERICAN MATHEMATICAL SOCIETY, 1954, 60 (06) :503-515
[5]   A polynomial time approximation scheme for the multiple knapsack problem [J].
Chekuri, C ;
Khanna, S .
SIAM JOURNAL ON COMPUTING, 2006, 35 (03) :713-728
[6]   Cache-Enabled Multicast Content Pushing With Structured Deep Learning [J].
Chen, Qi ;
Wang, Wei ;
Chen, Wei ;
Yu, F. Richard ;
Zhang, Zhaoyang .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (07) :2135-2149
[7]   Cache-Assisted Collaborative Task Offloading and Resource Allocation Strategy: A Metareinforcement Learning Approach [J].
Chen, Shiyou ;
Rui, Lanlan ;
Gao, Zhipeng ;
Li, Wenjing ;
Qiu, Xuesong .
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (20) :19823-19842
[8]   Caching in Narrow-Band Burst-Error Channels via Meta Self-Supervision Learning [J].
Chen, Ximing ;
Zhou, Fasheng ;
Tang, Dong .
IEEE COMMUNICATIONS LETTERS, 2022, 26 (12) :2929-2933
[9]   Hierarchical Cooperative Caching in Fog Radio Access Networks: A Brain Storm Optimization Approach [J].
Chen, Xuan ;
Jiang, Yanxiang ;
Fan, Baotian ;
Zheng, Fu-Chun ;
Niyato, Dusit ;
You, Xiaohu .
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[10]   Multi-Agent Reinforcement Learning Based 3D Trajectory Design in Aerial-Terrestrial Wireless Caching Networks [J].
Chen, Yu-Jia ;
Liao, Kai-Min ;
Ku, Meng-Lin ;
Tso, Fung Po ;
Chen, Guan-Yi .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (08) :8201-8215