Learning for Computation Offloading in Mobile Edge Computing

被引:169
作者
Dinh, Thinh Quang [1 ]
La, Quang Duy [1 ]
Quek, Tony Q. S. [1 ,2 ]
Shin, Hyundong [2 ]
机构
[1] Singapore Univ Technol & Design, Singapore 487372, Singapore
[2] Kyung Hee Univ, Dept Elect Engn, Yongin 17104, South Korea
基金
新加坡国家研究基金会;
关键词
Mobile edge computing; computation offloading; Q-learning; exact potential game; unknown noisy payoff game; strategy learning; RESOURCE-ALLOCATION; JOINT OPTIMIZATION; POWER-CONTROL; CLOUD; EXECUTION; EVOLUTION; RADIO;
D O I
10.1109/TCOMM.2018.2866572
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Mobile edge computing (MEC) is expected to provide cloud-like capacities for mobile users (MUs) at the edge of wireless networks. However, deploying MEC systems faces many challenges, one of which is to achieve an efficient distributed offloading mechanism for multiple users in time-varying wireless environments. In this paper, we study a multi-user multi-edgenode computation offloading problem. Since edge nodes' communication and computing capacities are limited which leads resource contention when many MUs offload to the same edge node at the same time, we formulate this problem as a noncooperative exact potential game (EPG), where each MU, in each time slot, selfishly maximizes its number of processed central processor unit (CPU) cycles and reduces its energy consumption. Assuming that channel information is static and available to MUs, we show that MUs could achieve a Nash equilibrium via a best response-based offloading mechanism. Next, we extend the problem to a practical scenario, where the number of processed CPU cycles is time-varying and unknown to MUs because of the uncertain channel information. In this case, we adopt an unknown payoff game framework and prove that the EPG properties still hold. Then, we propose a model-free reinforcement learning offloading mechanism which helps MUs learn their long-term offloading strategies to maximize their long-term utilities. Numerical results illustrate that our proposed algorithm for unknown CSI outperforms other schemes, such as local processing and random assignment, and achieves up to 87.87% average long-term payoffs compared to the perfect CSI case.
引用
收藏
页码:6353 / 6367
页数:15
相关论文
共 39 条
[1]  
Alpcan T, 2006, IEEE T WIREL COMMUN, V5, P890, DOI [10.1109/TWC.2006.1618938, 10.1109/TWC.2006.04023]
[2]   ERGODICITY IN PARAMETRIC NONSTATIONARY MARKOV-CHAINS - AN APPLICATION TO SIMULATED ANNEALING METHODS [J].
ANILY, S ;
FEDERGRUEN, A .
OPERATIONS RESEARCH, 1987, 35 (06) :867-874
[3]  
[Anonymous], 2010, P ACM MOBISYS, DOI [10.1145/1814433.1814441, DOI 10.1145/1814433.1814441]
[4]  
[Anonymous], P IEEE WCNC
[5]  
[Anonymous], 2015, Ericsson Mobility Report
[6]   LTE: The Evolution of Mobile Broadband [J].
Astely, David ;
Dahlman, Erik ;
Furuskar, Anders ;
Jading, Ylva ;
Lindstrom, Magnus ;
Parkvall, Stefan .
IEEE COMMUNICATIONS MAGAZINE, 2009, 47 (04) :44-51
[7]   CONVERGENT LEARNING ALGORITHMS FOR UNKNOWN REWARD GAMES [J].
Chapman, Archie C. ;
Leslie, David S. ;
Rogers, Alex ;
Jennings, Nicholas R. .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (04) :3154-3180
[8]   Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing [J].
Chen, Xu ;
Jiao, Lei ;
Li, Wenzhong ;
Fu, Xiaoming .
IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (05) :2827-2840
[9]   Decentralized Computation Offloading Game for Mobile Cloud Computing [J].
Chen, Xu .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (04) :974-983
[10]  
Chun BG, 2011, EUROSYS 11: PROCEEDINGS OF THE EUROSYS 2011 CONFERENCE, P301