Content Caching Policy for 5G Network Based on Asynchronous Advantage Actor-Critic Method

被引：1

作者：

Shi, Zhuoyang ^{[1
]}

Li, Lixin ^{[1
]}

Xu, Yang ^{[1
]}

Li, Xu ^{[1
]}

Chen, Wei ^{[2
]}

Han, Zhu ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Peoples R China

[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

[3] Univ Houston, Dept Elect & Comp Engn, Houston, TX 77004 USA

来源：

2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM) | 2019年

基金：

中国博士后科学基金;

关键词：

Deep reinforcement learning; asynchronous advantage actor-critic; content caching; transmission cost;

D O I：

10.1109/globecom38437.2019.9014268

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Nowadays content caching at base stations (BSs) has attracted more and more attention in SG networks with the ability of saving resources and reducing data traffic. However, in practice, it's a challenge to design a caching policy intelligently due to the limited storage capacity as well as time and space varying users' requests. In this paper, we propose an algorithm based on asynchronous advantage actor-critic (A3C) to solve the content caching problem. Considering some cooperative BSs, with each BS having a cache, every BS can fetch contents from either neighboring BSs or the backbone network, with different degrees of expenditure. In order to learn the optimal caching and sharing policy, the online A3C-based algorithm is designed to minimize the total transmission cost without knowing content popularity distribution. To evaluate the proposed algorithm, we compare the performance with the classical caching policies, including Least Recently Used (LRU), Least Frequently Used (LFU), Adaptive Replacement Cache (ARC) and one distributed algorithm in the literature. The simulation results show that the proposed A3C-based algorithm can achieve a low transmission cost and improve the convergence rate in the dynamic environment.

引用

页数：6

共 12 条

[1]

Bastug E, 2015, 2015 13th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt), P161, DOI 10.1109/WIOPT.2015.7151068

[2]

Gu JX, 2014, IEEE ICC, P2648, DOI 10.1109/ICC.2014.6883723

[3] Fundamental Limits of Caching [J].

Maddah-Ali, Mohammad Ali ;

Niesen, Urs .

IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (05) :2856-2867

[4]

Mnih V., 2016, 33rd International Conference on Machine Learning, ICML 2016, V4, P2850, DOI DOI 10.48550/ARXIV.1602.01783

[5] The Role of Caching in Future Communication Systems and Networks [J].

Paschos, Georgios S. ;

Iosifidis, George ;

Tao, Meixia ;

Towsley, Don ;

Caire, Giuseppe .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2018, 36 (06) :1111-1125

[6]

Raigoza J, 2014, 2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), P131, DOI 10.1109/ICIS.2014.6912120

[7]

Rogova G, 2002, PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOL II, P1263, DOI 10.1109/ICIF.2002.1020958

[8] Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities [J].

Sadeghi, Alireza ;

Sheikholeslami, Fatemeh ;

Giannakis, Georgios B. .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (01) :180-190

[9] Learning-Based Content Caching and Sharing for Wireless Networks [J].

Song, Jiongjiong ;

Sheng, Min ;

Quek, Tony Q. S. ;

Xu, Chao ;

Wang, Xijun .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2017, 65 (10) :4309-4324

[10]

Sutton RS, 2018, ADAPT COMPUT MACH LE, P1

← 1 2 →