CA-DTS: A Distributed and Collaborative Task Scheduling Algorithm for Edge Computing Enabled Intelligent Road Network

被引：6

作者：

Hu, Shi-Hong ^{[1
,2
]}

Luo, Qu-Yuan ^{[3
]}

Li, Guang-Hui ^{[4
]}

Shi, Weisong ^{[5
]}

Ye, Bao-Liu ^{[2
,6
]}

机构：

[1] Houhai Univ, Minist Water Resources, Key Lab Water Big Data Technol, Nanjing 210098, Peoples R China

[2] Houhai Univ, Sch Comp & Informat, Nanjing 210098, Peoples R China

[3] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China

[4] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Peoples R China

[5] Univ Delaware, Dept Comp & Informat Sci, Newark, DE 19716 USA

[6] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210093, Peoples R China

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2023年 / 38卷 / 05期

基金：

中国国家自然科学基金;

关键词：

edge computing; deep reinforcement learning; task scheduling; vehicular edge computing; RESOURCE-ALLOCATION; REINFORCEMENT; TIME;

D O I：

10.1007/s11390-023-2839-0

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Edge computing enabled Intelligent Road Network (EC-IRN) provides powerful and convenient computing services for vehicles and roadside sensing devices. The continuous emergence of transportation applications has caused a huge burden on roadside units (RSUs) equipped with edge servers in the Intelligent Road Network (IRN). Collaborative task scheduling among RSUs is an effective way to solve this problem. However, it is challenging to achieve collaborative scheduling among different RSUs in a completely decentralized environment. In this paper, we first model the interactions involved in task scheduling among distributed RSUs as a Markov game. Given that multi-agent deep reinforcement learning (MADRL) is a promising approach for the Markov game in decision optimization, we propose a collaborative task scheduling algorithm based on MADRL for EC-IRN, named CA-DTS, aiming to minimize the long-term average delay of tasks. To reduce the training costs caused by trial-and-error, CA-DTS specially designs a reward function and utilizes the distributed deployment and collective training architecture of counterfactual multi-agent policy gradient (COMA). To improve the stability of performance in large-scale environments, CA-DTS takes advantage of the action semantics network (ASN) to facilitate cooperation among multiple RSUs. The evaluation results of both the testbed and simulation demonstrate the effectiveness of our proposed algorithm. Compared with the baselines, CA-DTS can achieve convergence about 35% faster, and obtain average task delay that is lower by approximately 9.4%, 9.8%, and 6.7%, in different scenarios with varying numbers of RSUs, service types, and task arrival rates, respectively.

引用

页码：1113 / 1131

页数：19

共 36 条

[1] Resource Allocation and Service Provisioning in Multi-Agent Cloud Robotics: A Comprehensive Survey [J].

Afrin, Mahbuba ;

Jin, Jiong ;

Rahman, Akhlaqur ;

Rahman, Ashfaqur ;

Wan, Jiafu ;

Hossain, Ekram .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (02) :842-870

[2] On-Edge Multi-Task Transfer Learning: Model and Practice With Data-Driven Task Allocation [J].

Chen, Qiong ;

Zheng, Zimu ;

Hu, Chuang ;

Wang, Dan ;

Liu, Fangming .

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (06) :1357-1371

[3] Optimal Admission Control Mechanism Design for Time-Sensitive Services in Edge Computing [J].

Chen, Shutong ;

Wang, Lin ;

Liu, Fangming .

IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2022), 2022, :1169-1178

[4]

Chen SB, 2022, Arxiv, DOI arXiv:2208.14052

[5] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks [J].

Cui, Jingjing ;

Liu, Yuanwei ;

Nallanathan, Arumugam .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) :729-743

[6]

Filar J., 1996, Competitive Markov Decision Processes, DOI DOI 10.1007/978-1-4612-4054-9

[7]

Foerster J., 2018, Counterfactual multi-agent policy gradients. In Proc. the 32nd AAAI Conference on, DOI [10.1609/aaai.v32i1.11794, DOI 10.1609/AAAI.V32I1.11794]

[8] 3D Detection and Pose Estimation of Vehicle in Cooperative Vehicle Infrastructure System [J].

Guo, Ente ;

Chen, Zhifeng ;

Rahardja, Susanto ;

Yang, Jingjing .

IEEE SENSORS JOURNAL, 2021, 21 (19) :21759-21771

[9] Multi-Agent Deep Reinforcement Learning for Computation Offloading and Interference Coordination in Small Cell Networks [J].

Huang, Xiaoyan ;

Leng, Supeng ;

Maharjan, Sabita ;

Zhang, Yan .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (09) :9282-9293

[10] UTILIZATION OF IDLE TIME IN AN M-G-1 QUEUING SYSTEM [J].

LEVY, Y ;

YECHIALI, U .

MANAGEMENT SCIENCE, 1975, 22 (02) :202-211

← 1 2 3 4 →