Deep Q-Learning Based Computation Offloading Strategy for Mobile Edge Computing

被引：39

作者：

Wei, Yifei ^{[1
]}

Wang, Zhaoying ^{[1
]}

Guo, Da ^{[1
]}

Yu, F. Richard ^{[2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing Key Lab Work Safety Intelligent Monitorin, Beijing 100876, Peoples R China

[2] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2019年 / 59卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Mobile edge computing; computation offloading; resource allocation; deep reinforcement learning; RESOURCE-ALLOCATION; NETWORKS;

D O I：

10.32604/cmc.2019.04836

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To reduce the transmission latency and mitigate the backhaul burden of the centralized cloud-based network services, the mobile edge computing (MEC) has been drawing increased attention from both industry and academia recently. This paper focuses on mobile users' computation offloading problem in wireless cellular networks with mobile edge computing for the purpose of optimizing the computation offloading decision making policy. Since wireless network states and computing requests have stochastic properties and the environment's dynamics are unknown, we use the model-free reinforcement learning (RL) framework to formulate and tackle the computation offloading problem. Each mobile user learns through interactions with the environment and the estimate of its performance in the form of value function, then it chooses the overhead-aware optimal computation offloading action (local computing or edge computing) based on its state. The state spaces are high-dimensional in our work and value function is unrealistic to estimate. Consequently, we use deep reinforcement learning algorithm, which combines RL method Q-learning with the deep neural network (DNN) to approximate the value functions for complicated control applications, and the optimal policy will be obtained when the value function reaches convergence. Simulation results showed that the effectiveness of the proposed method in comparison with baseline methods in terms of total overheads of all mobile users.

引用

页码：89 / 104

页数：16

共 22 条

[1] Markov Decision Processes With Applications in Wireless Sensor Networks: A Survey [J].

Abu Alsheikh, Mohammad ;

Dinh Thai Hoang ;

Niyato, Dusit ;

Tan, Hwee-Pink ;

Lin, Shaowei .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (03) :1239-1267

[2]

[Anonymous], COMPUTERS COMMUNICAT

[3] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[4] Decentralized Computation Offloading Game for Mobile Cloud Computing [J].

Chen, Xu .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (04) :974-983

[5] A Dynamic Offloading Algorithm for Mobile Computing [J].

Huang, Dong ;

Wang, Ping ;

Niyato, Dusit .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2012, 11 (06) :1991-1995

[6]

Iosifidis G, 2013, 2013 11TH INTERNATIONAL SYMPOSIUM ON MODELING & OPTIMIZATION IN MOBILE, AD HOC & WIRELESS NETWORKS (WIOPT), P154

[7] Small cell backhaul: challenges and prospective solutions [J].

Jafari, Amir H. ;

Lopez-Perez, David ;

Song, Hui ;

Claussen, Holger ;

Ho, Lester ;

Zhang, Jie .

EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2015, :1-18

[8]

Khatana A., 2018, INT J COMPUTER SCI E, V6, P440

[9] A Survey of Computation Offloading for Mobile Systems [J].

Kumar, Karthik ;

Liu, Jibang ;

Lu, Yung-Hsiang ;

Bhargava, Bharat .

MOBILE NETWORKS & APPLICATIONS, 2013, 18 (01) :129-140

[10] A Survey on Mobile Edge Computing: The Communication Perspective [J].

Mao, Yuyi ;

You, Changsheng ;

Zhang, Jun ;

Huang, Kaibin ;

Letaief, Khaled B. .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2017, 19 (04) :2322-2358

← 1 2 3 →