Dynamic Deployment of DNN Inference Tasks Based on Distributed Proximal Policy Optimization

被引:0
作者
He, Wenchen [1 ]
Li, Yitao [1 ]
Wang, Liqiang [1 ]
机构
[1] Natl Comp Network Emergency Response Tech Team Co, Beijing 100029, Peoples R China
来源
COMPUTER NETWORKS AND IOT, PT 3, IAIC 2023 | 2024年 / 2060卷
关键词
Edge computing; Task deployment; Reinforcement learning; terminal mobility; RESOURCE-ALLOCATION;
D O I
10.1007/978-981-97-1332-5_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the widespread adoption of edge computing, Deep Neural Networks (DNN) inference tasks are gradually deployed on edge computing nodes. The inference and decision-making process of intelligent services is moved to the edge side, reusing edge resources to provide ubiquitous services. However, during the service process, due to constrained edge resources or factors such as terminal mobility, DNN inference tasks may experience long delays or service interruptions, affecting the timeliness and continuity of the services. To address the problem of deteriorated communication conditions and reduced data transmission efficiency during terminal mobility, which leads to decreased service quality or even interruptions, a dynamic deployment method for DNN inference tasks based on distributed proximal policy optimization (DPPO) is proposed. Building upon an edge-terminal collaborative architecture for dynamic deployment of DNN inference tasks, this method takes into account the terminal's location, communication conditions, and the availability of resources in accessible edge nodes. The process involves DNN model caching, inference computation offloading, as well as communication and computation resource allocation. The experimental results demonstrate that the proposed method can adapt to the dynamic environment of the edge and achieve the integration and on-demand allocation of edge multidimensional resources, effectively ensuring service continuity.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 10 条
[1]   Dynamic Resource Allocation and Computation Offloading for IoT Fog Computing System [J].
Chang, Zheng ;
Liu, Liqing ;
Guo, Xijuan ;
Sheng, Quan .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (05) :3348-3357
[2]   Maximization of Value of Service for Mobile Collaborative Computing Through Situation-Aware Task Offloading [J].
Chen, Ruitao ;
Wang, Xianbin .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (02) :1049-1065
[3]   QoS Driven Task Offloading With Statistical Guarantee in Mobile Edge Computing [J].
Li, Qing ;
Wang, Shangguang ;
Zhou, Ao ;
Ma, Xiao ;
Yang, Fangchun ;
Liu, Alex X. .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (01) :278-290
[4]  
Liu J., 2021, IEEE Trans. Mob. Comput., V1
[5]   Dynamic Computation Offloading and Resource Allocation for Multi-user Mobile Edge Computing [J].
Nath, Samrat ;
Wu, Jingxian .
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[6]   Priority-Aware Task Offloading in Vehicular Fog Computing Based on Deep Reinforcement Learning [J].
Shi, Jinming ;
Du, Jun ;
Wang, Jingjing ;
Wang, Jian ;
Yuan, Jian .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) :16067-16081
[7]   Joint Task Offloading and Resource Allocation for NOMA-Enabled Multi-Access Mobile Edge Computing [J].
Song, Zhengyu ;
Liu, Yuanwei ;
Sun, Xin .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) :1548-1564
[8]   Energy-Efficient Joint Task Offloading and Resource Allocation in OFDMA-Based Collaborative Edge Computing [J].
Tan, Lin ;
Kuang, Zhufang ;
Zhao, Lian ;
Liu, Anfeng .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (03) :1960-1972
[9]   COOPERATIVE CONTENT CACHING IN 5G NETWORKS WITH MOBILE EDGE COMPUTING [J].
Zhang, Ke ;
Leng, Supeng ;
He, Yejun ;
Maharjan, Sabita ;
Zhang, Yan .
IEEE WIRELESS COMMUNICATIONS, 2018, 25 (03) :80-87
[10]   Dynamic Computation Offloading for Mobile Cloud Computing: A Stochastic Game-Theoretic Approach [J].
Zheng, Jianchao ;
Cai, Yueming ;
Wu, Yuan ;
Shen, Xuemin .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2019, 18 (04) :771-786