Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks

被引：16

作者：

Jiang, Wei ^{[1
]}

Feng, Daquan ^{[1
]}

Sun, Yao ^{[2
]}

Feng, Gang ^{[3
,4
]}

Wang, Zhenzhong ^{[5
]}

Xia, Xiang-Gen ^{[6
]}

机构：

[1] Shenzhen Univ, Guangdong Prov Engn Lab Digital Creat Technol, Shenzhen Key Lab Digital Creat Technol, Coll Elect & Informat Engn,Guangdong Key Lab Inte, Shenzhen 518060, Peoples R China

[2] Univ Glasgow, James Watt Sch Engn, Glasgow G12 8QQ, Lanark, Scotland

[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Scotland

[4] Univ Elect Sci & Technol China, Natl Key Lab Sci & Technol Commun, Chengdu 611731, Peoples R China

[5] Tech Management Ctr, China Media Grp, Beijing 100020, Peoples R China

[6] Univ Delaware, Dept Elect & Comp Engn, Newark, DE 19716 USA

来源：

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING | 2022年 / 8卷 / 02期

基金：

国家重点研发计划;

关键词：

Actor-critic algorithm; branching neural network; reinforcement learning; mobile edge caching; 5G NETWORKS; SMALL-CELL; DELIVERY; POLICY;

D O I：

10.1109/TCCN.2021.3130995

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Mobile edge caching/computing (MEC) has emerged as a promising approach for addressing the drastic increasing mobile data traffic by bringing high caching and computing capabilities to the edge of networks. Under MEC architecture, content providers (CPs) are allowed to lease some virtual machines (VMs) at MEC servers to proactively cache popular contents for improving users' quality of experience. The scalable cache resource model rises the challenge for determining the ideal number of leased VMs for CPs to obtain the minimum expected downloading delay of users at the lowest caching cost. To address these challenges, in this paper, we propose an actor-critic (AC) reinforcement learning based proactive caching policy for mobile edge networks without the prior knowledge of users' content demand. Specifically, we formulate the proactive caching problem under dynamical users' content demand as a Markov decision process and propose a AC based caching algorithm to minimize the caching cost and the expected downloading delay. Particularly, to reduce the computational complexity, a branching neural network is employed to approximate the policy function in the actor part. Numerical results show that the proposed caching algorithm can significantly reduce the total cost and the average downloading delay when compared with other popular algorithms.

引用

页码：1239 / 1252

页数：14

共 50 条

[1] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
Fu, Fang
Zhang, Zhicai
Yu, Fei Richard
Yan, Qiao
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1875 - 1889
[2] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
Fang Fu
Zhicai Zhang
Fei Richard Yu
Qiao Yan
International Journal of Machine Learning and Cybernetics, 2020, 11 : 1875 - 1889
[3] Multi-Agent Reinforcement Learning Based Cooperative Content Caching for Mobile Edge Networks
Jiang, Wei
Feng, Gang
Qin, Shuang
Liu, Yijing
IEEE ACCESS, 2019, 7 : 61856 - 61867
[4] Manipulator Motion Planning based on Actor-Critic Reinforcement Learning
Li, Qiang
Nie, Jun
Wang, Haixia
Lu, Xiao
Song, Shibin
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4248 - 4254
[5] Evaluating Correctness of Reinforcement Learning based on Actor-Critic Algorithm
Kim, Youngjae
Hussain, Manzoor
Suh, Jae-Won
Hong, Jang-Eui
2022 THIRTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN), 2022, : 320 - 325
[6] UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning
Araf, Sadman
Saha, Adittya Soukarjya
Kazi, Sadia Hamid
Tran, Nguyen H. H.
Alam, Md. Golam Rabiul
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2322 - 2337
[7] A World Model for Actor-Critic in Reinforcement Learning
Panov, A. I.
Ugadiarov, L. A.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
[8] A fuzzy Actor-Critic reinforcement learning network
Wang, Xue-Song
Cheng, Yu-Hu
Yi, Jian-Qiang
INFORMATION SCIENCES, 2007, 177 (18) : 3764 - 3781
[9] Research on actor-critic reinforcement learning in RoboCup
Guo, He
Liu, Tianying
Wang, Yuxin
Chen, Feng
Fan, Jianming
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 205 - 205
[10] Multi-actor mechanism for actor-critic reinforcement learning
Li, Lin
Li, Yuze
Wei, Wei
Zhang, Yujia
Liang, Jiye
INFORMATION SCIENCES, 2023, 647

← 1 2 3 4 5 →