NA-Caching: An Adaptive Content Management Approach Based on Deep Reinforcement Learning

被引：6

作者：

Fan, Qilin ^{[1
,2
]}

Li, Xiuhua ^{[1
,2
]}

Wang, Sen ^{[1
,2
]}

Fu, Shu ^{[3
]}

Zhang, Xu ^{[4
]}

Wang, Yueyang ^{[1
,2
]}

机构：

[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China

[2] Chongqing Univ, Inst Intelligent Network & Edge Comp, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 401331, Peoples R China

[3] Chongqing Univ, Coll Commun Engn, Chongqing 401331, Peoples R China

[4] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Jiangsu, Peoples R China

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

中国国家自然科学基金;

关键词：

Servers; Streaming media; Internet; Adaptive systems; Quality of service; Reinforcement learning; Recurrent neural networks; Content management; deep reinforcement learning; quality of service; content delivery network; WEB; POPULARITY; ALGORITHMS;

D O I：

10.1109/ACCESS.2019.2947460

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video streaming is a dominant application over todays Internet. The current mainstream video streaming solution is to utilize the services of a Content Delivery Network (CDN) provider. By replicating video content closer to the network edge, caching provides an effective mechanism for alleviating the demand for massive bandwidth for the Internet backbone. It reduces the network traffic and capital expense for streaming the video content, and in the meantime, enhance Internets Quality of Service (QoS). In this paper, we propose a neural adaptive caching approach, named NA-Caching, for helping cache learn to make caching decisions from its own experiences rather than a specific mathematical model, in a way similar to how a human being learns a new skill (e.g. cycling, swimming). NA-Caching leverages the benefits of the Recurrent Neural Network (RNN) as well as the Deep Reinforcement Learning (DRL) to maximize the cache efficiency by jointly learning request features, caching space dynamics and making decisions. Specifically, we utilize Gated Recurrent Unit (GRU) to characterize the evolving features of the dynamic requests and caching space. Moreover, the above GRU-based representation network is integrated into a Deep Q-Network (DQN) framework for making adaptive caching decisions online. To evaluate the performance of the proposed approach, we conduct extensive experiments on anonymized real-world traces from a video provider. The results demonstrate that our algorithm significantly outperform several candidate methods.

引用

页码：152014 / 152022

页数：9

共 40 条

[1]

[Anonymous], 2014, ABS14091259 CORR

[2]

[Anonymous], 1973, OPERATING SYSTEMS TH

[3]

[Anonymous], 2013, ARXIV13014909

[4]

[Anonymous], 2015, P INT C LEARN REPR I

[5] Optimal Content Placement for a Large-Scale VoD System [J].

Applegate, David ;

Archer, Aaron ;

Gopalakrishnan, Vijay ;

Lee, Seungjoon ;

Ramakrishnan, K. K. .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (04) :2114-2127

[6] A STUDY OF REPLACEMENT ALGORITHMS FOR A VIRTUAL-STORAGE COMPUTER [J].

BELADY, LA .

IBM SYSTEMS JOURNAL, 1966, 5 (02) :78-&

[7]

Blasco P, 2014, IEEE ICC, P1897, DOI 10.1109/ICC.2014.6883600

[8]

Cao P, 1997, PROCEEDINGS OF THE USENIX SYMPOSIUM ON INTERNET TECHNOLOGIES AND SYSTEMS, P193

[9] Hierarchical web caching systems: Modeling, design and experimental results [J].

Che, H ;

Tung, Y ;

Wang, ZJ .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2002, 20 (07) :1305-1314

[10]

Cisco Systems Inc., 2017, Cisco Visual Networking Index: Forecast and Trends, 2017-2022 (white paper)

← 1 2 3 4 →