Reinforcement-Learning- and Belief-Learning-Based Double Auction Mechanism for Edge Computing Resource Allocation

被引：46

作者：

Li, Quanyi ^{[1
]}

Yao, Haipeng ^{[1
]}

Mai, Tianle ^{[1
]}

Jiang, Chunxiao ^{[2
]}

Zhang, Yan ^{[3
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100876, Peoples R China

[2] Tsinghua Univ, Tsinghua Space Ctr, Beijing 100084, Peoples R China

[3] Univ Oslo, Dept Informat, N-0316 Oslo, Norway

来源：

IEEE INTERNET OF THINGS JOURNAL | 2020年 / 7卷 / 07期

关键词：

Task analysis; Resource management; Games; Cloud computing; Heuristic algorithms; Reinforcement learning; Double auction game; experience-weighted attraction (EWA); latency-sensitive businesses; mobile edge computing (MEC); MARKET; GAMES;

D O I：

10.1109/JIOT.2019.2953108

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, we have witnessed the compelling application of the Internet of Things (IoT) in our daily life, ranging from daily living to industrial production. On account of the computation and power constraints, the IoT devices have to offload their tasks to the remote cloud services. However, the long-distance transmission poses significant challenges for latency-sensitive businesses, such as autonomous driving and industrial control. As a remedy, mobile edge computing (MEC) is deployed at the edge of the network to reduce the transmission delay. With the MEC joining in, how to allocate the limited computing resource of MEC is a critical problem to guarantee efficient working of the whole IoT system. In this article, we formulate the resource management among MEC and IoT devices as a double auction game. Also, for searching the Nash equilibrium, we introduce the experience-weighted attraction (EWA) algorithm performing behind each participant. With this AI method, auction participants acquire and accumulate experience by observing others' behavior and doing introspection, which accelerates the trading policy's learning process of each agent in such an opaque environment. Some simulation results are presented to evaluate the convergence and correctness of our architecture and algorithm.

引用

页码：5976 / 5985

页数：10

共 31 条

[1]

[Anonymous], [No title captured]

[2]

[Anonymous], 2019, ARXIV

[3] Experience-weighted attraction learning in coordination games: Probability rules, heterogeneity, and time-variation [J].

Camerer, C ;

Ho, TH .

JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1998, 42 (2-3) :305-326

[4]

Camerer C., 1997, 1003 CALTECH DIV HUM

[5] When UAV Swarm Meets Edge-Cloud Computing: The QoS Perspective [J].

Chen, Wuhui ;

Liu, Baichuan ;

Huang, Huawei ;

Guo, Song ;

Meng, Zibin .

IEEE NETWORK, 2019, 33 (02) :36-43

[6]

Dawei Sun, 2010, 2010 International Conference on Computer Design and Applications (ICCDA 2010), P94, DOI 10.1109/ICCDA.2010.5541088

[7] Double Auction Mechanism Design for Video Caching in Heterogeneous Ultra-Dense Networks [J].

Du, Jun ;

Jiang, Chunxiao ;

Gelenbe, Erol ;

Zhang, Haijun ;

Ren, Yong ;

Quek, Tony Q. S. .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (03) :1669-1683

[8]

FRIEDMAN D, 1993, SFI S SCI C, V14, P3

[9] A Double-Auction Mechanism for Mobile Data-Offloading Markets [J].

Iosifidis, George ;

Gao, Lin ;

Huang, Jianwei ;

Tassiulas, Leandros .

IEEE-ACM TRANSACTIONS ON NETWORKING, 2015, 23 (05) :1634-1647

[10] Dynamic Pricing and Energy Consumption Scheduling With Reinforcement Learning [J].

Kim, Byung-Gook ;

Zhang, Yu ;

van der Schaar, Mihaela ;

Lee, Jang-Won .

IEEE TRANSACTIONS ON SMART GRID, 2016, 7 (05) :2187-2198

← 1 2 3 4 →