Incentive-Aware Partitioning and Offloading Scheme for Inference Services in Edge Computing

被引:0
作者
Kim, TaeYoung [1 ]
Kim, Chang Kyung [1 ]
Lee, Seung-seob [2 ]
Lee, Sukyoung [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 03722, South Korea
[2] Yale Univ, New Haven, CT 06520 USA
基金
新加坡国家研究基金会;
关键词
Task analysis; Delays; Computational modeling; Servers; Edge computing; Games; Artificial neural networks; Incentive; DNN Partitioning; offloading; scheduling; utility; energy; inference delay; game theory; RESOURCE-ALLOCATION; DRIVEN; DEPLOYMENT; MECHANISM;
D O I
10.1109/TSC.2024.3359148
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Owing to remarkable improvements in deep neural networks (DNNs), various computation-intensive and delay-sensitive DNN services have been developed for smart IoT devices. However, employing these services on the devices is challenging due to their limited battery capacity and computational constraints. Although edge computing is proposed as a solution, edge devices cannot meet the performance requirements of DNN services because the majority of IoT applications require simultaneous inference services, and DNN models grow larger. To address this problem, we propose a framework that enables parallel execution of partitioned and offloaded DNN inference services over multiple distributed edge devices. Noteworthy, edge devices are reluctant to process tasks due to their energy consumption. Thus, to provide an incentive mechanism for edge devices, we model the interaction between the edge devices and DNN inference service users as a two-level Stackelberg game. Based on this model, we design the proposed framework to determine the optimal scheduling with a partitioning strategy, aiming to maximize user satisfaction while incentivizing the participation of edge devices. We further derive the Nash equilibrium points in the two levels. The simulation results show that the proposed scheme outperforms other benchmark methods in terms of user satisfaction and profits of edge devices.
引用
收藏
页码:1580 / 1592
页数:13
相关论文
共 39 条
[1]   BARA: A blockchain-aided auction-based resource allocation in edge computing enabled industrial internet of things [J].
Baranwal, Gaurav ;
Kumar, Dinesh ;
Vidyarthi, Deo Prakash .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 :333-347
[2]   Green Parallel Online Offloading for DSCI-Type Tasks in IoT-Edge Systems [J].
Chen, Junqi ;
Wu, Huaming ;
Li, Ruidong ;
Jiao, Pengfei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (11) :7955-7966
[3]   DNNOff: Offloading DNN-Based Intelligent IoT Applications in Mobile Edge Computing [J].
Chen, Xing ;
Li, Ming ;
Zhong, Hao ;
Ma, Yun ;
Hsu, Ching-Hsien .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (04) :2820-2829
[4]   Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments [J].
Chen, Xing ;
Zhang, Jianshan ;
Lin, Bing ;
Chen, Zheyi ;
Wolter, Katinka ;
Min, Geyong .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (03) :683-697
[5]   Multi-server Multi-user Game at Edges for Heterogeneous Video Analytics [J].
Chen, Yu ;
Zhang, Sheng ;
Jin, Yibo ;
Qian, Zhuzhong ;
Lu, Sanglu .
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, :841-846
[6]   Incentive-Driven Proactive Application Deployment and Pricing on Distributed Edges [J].
Deng, Shuiguang ;
Chen, Yishan ;
Chen, Gong ;
Ji, Shouling ;
Yin, Jianwei ;
Zomaya, Albert Y. .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (02) :951-967
[7]   Deep Reinforcement Learning for Trajectory Path Planning and Distributed Inference in Resource-Constrained UAV Swarms [J].
Dhuheir, Marwan ;
Baccour, Emna ;
Erbad, Aiman ;
Al-Obaidi, Sinan Sabeeh ;
Hamdi, Mounir .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (09) :8185-8201
[8]   Incentive Mechanism and Resource Allocation for Edge-Fog Networks Driven by Multi-Dimensional Contract and Game Theories [J].
Diamanti, Maria ;
Charatsaris, Panagiotis ;
Tsiropoulou, Eirini Eleni ;
Papavassiliou, Symeon .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2022, 3 :435-452
[9]   Joint DNN Partition and Resource Allocation for Task Offloading in Edge-Cloud-Assisted IoT Environments [J].
Fan, Wenhao ;
Gao, Li ;
Su, Yi ;
Wu, Fan ;
Liu, Yuan'an .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (12) :10146-10159
[10]   DNN Deployment, Task Offloading, and Resource Allocation for Joint Task Inference in IIoT [J].
Fan, Wenhao ;
Chen, Zeyu ;
Hao, Zhibo ;
Su, Yi ;
Wu, Fan ;
Tang, Bihua ;
Liu, Yuan'an .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (02) :1634-1646