Stochastic-Stackelberg-Game-Based Edge Service Selection for Massive IoT Networks

被引：4

作者：

Liang, Hui ^{[1
,2
]}

Zhang, Wei ^{[3
,4
]}

机构：

[1] Dongguan Univ Technol, Sch Elect Engn & Intelligentizat, Dongguan 523808, Guangdong, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518055, Guangdong, Peoples R China

[3] Zaozhuang Univ, Coll Optoelect Engn, Zaozhuang 277160, Shandong, Peoples R China

[4] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 24期

关键词：

Bayesian deep Q-learning network (BDQN); Bayesian neural network (BNN); mobile edge computing (MEC); partially observable Markov decision process (POMDP); stochastic Stackelberg game; RANDOM-ACCESS;

D O I：

10.1109/JIOT.2023.3303480

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Mobile edge computing is a promising technique to provide timely edge service for the Internet of Things. With a continuously expanding network scale, the number and types of devices in the IoT network are growing rapidly. Moreover, the total number of devices that need edge service actually in the network is uncertain because the devices are online or offline frequently. It is very critical to design an efficient service selection strategy for every device to access edge service providers (ESPs) in such a huge number of devices situation. To address the problem, this article first formulates the access of IoT devices as a stochastic Stackelberg game model. Then, for the edge service selection of devices, a Poisson game is utilized to model the coordinated selection of edge services, we propose a distributed iterative algorithm to solve this game and give the optimal edge service selection strategy for each device group. Next, for the noncooperative competition among ESPs with incomplete information, the service prices of other ESPs are first estimated through a Bayesian neural network (BNN). Then we model this stochastic noncooperative game as a partially observable Markov decision process (POMDP) model, and finally obtain the optimal long-term pricing strategy through Bayesian deep Q -learning network (BDQN) algorithm. The experimental simulation results show that the proposed strategy can significantly improve the utilities of both ESPs and devices and effectively decrease the average queuing probability of all devices.

引用

页码：22080 / 22095

页数：16

共 38 条

[1] Random Access for M2M Communications With QoS Guarantees [J].

Abbas, Rana ;

Shirvanimoghaddam, Mahyar ;

Li, Yonghui ;

Vucetic, Branka .

IEEE TRANSACTIONS ON COMMUNICATIONS, 2017, 65 (07) :2889-2903

[2]

[Anonymous], 2011, Study RAN Improvements for Machine-Type Communications, document V11.0.0

[3]

[Anonymous], 2019, 5G for Connected Industries and Automation

[4] Distributed Dynamic Resource Management and Pricing in the IoT Systems With Blockchain-as-a-Service and UAV-Enabled Mobile Edge Computing [J].

Asheralieva, Alia ;

Niyato, Dusit .

IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (03) :1974-1993

[5]

Bazaraa M.S., 2006, Nonlinear Programming: Theory and Algorithms

[6] Adaptive Performance Modeling Framework for QoS-Aware Offloading in MEC-Based IIoT Systems [J].

Bebortta, Sujit ;

Senapati, Dilip ;

Panigrahi, Chhabi Rani ;

Pati, Bibudhendu .

IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (12) :10162-10171

[7]

Bertsekas D., 2009, Convex optimization theory

[8]

Borwein Jonathan, 2006, Convex Analysis and Nonlinear Optimization

[9] A Comprehensive Distributed Queue-Based Random Access Framework for mMTC in LTE/LTE-A Networks With Mixed-Type Traffic [J].

Bui, Anh-Tuan H. ;

Nguyen, Chuyen T. ;

Truong Cong Thang ;

Pham, Anh T. .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (12) :12107-12120

[10] Traffic-Aware Sensor Grouping for IEEE 802.11ah Networks: Regression Based Analysis and Design [J].

Chang, Tung-Chun ;

Lin, Chi-Han ;

Lin, Kate Ching-Ju ;

Chen, Wen-Tsuen .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2019, 18 (03) :674-687

← 1 2 3 4 →