Reinforcement Learning Based Cooperative Coded Caching Under Dynamic Popularities in Ultra-Dense Networks

被引:26
作者
Gao, Shen [1 ,2 ]
Dong, Peihao [1 ]
Pan, Zhiwen [1 ,2 ]
Li, Geoffrey Ye [3 ]
机构
[1] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
[2] Purple Mt Labs, Nanjing 211100, Peoples R China
[3] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
Ultra-dense network; reinforcement learning; cooperative coded caching; popularity dynamics; WIRELESS; DELIVERY; DESIGN; TRANSMISSION; MIMO;
D O I
10.1109/TVT.2020.2979918
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
For ultra-dense networks with wireless backhaul, caching strategy at small base stations (SBSs), usually with limited storage, is critical to meet massive high data rate requests. Since the content popularity profile varies with time in an unknown way, we exploit reinforcement learning (RL) to design a cooperative caching strategy with maximum-distance separable (MDS) coding. We model the MDS coding based cooperative caching as a Markov decision process to capture the popularity dynamics and maximize the long-term expected cumulative traffic load served directly by the SBSs without accessing the macro base station. For the formulated problem, we first find the optimal solution for a small-scale system by embedding the cooperative MDS coding into Q-learning. To cope with the large-scale case, we approximate the state-action value function heuristically. The approximated function includes only a small number of learnable parameters and enables us to propose a fast and efficient action-selection approach, which dramatically reduces the complexity. Numerical results verify the optimality/near-optimality of the proposed RL based algorithms and show the superiority compared with the baseline schemes. They also exhibit good robustness to different environments.
引用
收藏
页码:5442 / 5456
页数:15
相关论文
共 47 条
[11]   Modeling and Analysis of K-Tier Downlink Heterogeneous Cellular Networks [J].
Dhillon, Harpreet S. ;
Ganti, Radha Krishna ;
Baccelli, Francois ;
Andrews, Jeffrey G. .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (03) :550-560
[12]   Deep CNN-Based Channel Estimation for mmWave Massive MIMO Systems [J].
Dong, Peihao ;
Zhang, Hua ;
Li, Geoffrey Ye ;
Gaspar, Ivan Simoes ;
NaderiAlizadeh, Navid .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (05) :989-1000
[13]   Contract Design for Traffic Offloading and Resource Allocation in Heterogeneous Ultra-Dense Networks [J].
Du, Jun ;
Gelenbe, Erol ;
Jiang, Chunxiao ;
Zhang, Haijun ;
Ren, Yong .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2017, 35 (11) :2457-2467
[14]   On Energy-Efficient Edge Caching in Heterogeneous Networks [J].
Gabry, Frederic ;
Bioglio, Valerio ;
Land, Ingmar .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2016, 34 (12) :3288-3298
[15]   Deep Learning Based Channel Estimation for Massive MIMO With Mixed-Resolution ADCs [J].
Gao, Shen ;
Dong, Peihao ;
Pan, Zhiwen ;
Li, Geoffrey Ye .
IEEE COMMUNICATIONS LETTERS, 2019, 23 (11) :1989-1993
[16]  
Ge XH, 2016, IEEE WIREL COMMUN, V23, P72, DOI 10.1109/MWC.2016.7422408
[17]   5G Wireless Backhaul Networks: Challenges and Research Advances [J].
Ge, Xiaohu ;
Cheng, Hui ;
Guizani, Mohsen ;
Han, Tao .
IEEE NETWORK, 2014, 28 (06) :6-11
[18]   Asymptotic Laws for Joint Content Replication and Delivery in Wireless Networks [J].
Gitzenis, Savvas ;
Paschos, Georgios S. ;
Tassiulas, Leandros .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2013, 59 (05) :2760-2776
[19]   ENABLING SMALL CELL DEPLOYMENT WITH HETNET [J].
Hoadley, John ;
Maveddat, Payam .
IEEE WIRELESS COMMUNICATIONS, 2012, 19 (02) :4-5
[20]   Deep Learning for Physical-Layer 5G Wireless Techniques: Opportunities, Challenges and Solutions [J].
Huang, Hongji ;
Guo, Song ;
Gui, Guan ;
Yang, Zhen ;
Zhang, Jianhua ;
Sari, Hikmet ;
Adachi, Fumiyuki .
IEEE WIRELESS COMMUNICATIONS, 2020, 27 (01) :214-222