Two-Level Scheduling Algorithms for Deep Neural Network Inference in Vehicular Networks

被引：3

作者：

Wu, Yalan ^{[1
,2
]}

Wu, Jigang ^{[3
]}

Yao, Mianyang ^{[3
]}

Liu, Bosheng ^{[3
]}

Chen, Long ^{[3
]}

Lam, Siew Kei ^{[2
]}

机构：

[1] Guangdong Univ Technol, Sch Integrated Circuits, Guangzhou 510006, Peoples R China

[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore

[3] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2023年 / 24卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Vehicular network; two-level task scheduling; DNN inference; quality of computing services; accelerator; RESOURCE-ALLOCATION; EDGE; ACCELERATION;

D O I：

10.1109/TITS.2023.3266795

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

In vehicular networks, task scheduling at the microarchitecture-level and network-level offers tremendous potential to improve the quality of computing services for deep neural network (DNN) inference. However, existing task scheduling works only focus on either one of the two levels, which results in inefficient utilization of computing resources. This paper aims to fill this gap by formulating a two-level scheduling problem for DNN inference tasks in a vehicular network, with an objective of minimizing total weighted sum of response time and energy consumption for all tasks under the following constraints: per task response time, per vehicle energy consumption, per vehicle storage capacity. We first formulate the problem and prove that it is NP-hard. A group transformation based algorithm, called GTA, is proposed. GTA makes scheduling decisions at the network-level using the group transformation based approach, and at the microarchitecture-level using a greedy strategy. In addition, an algorithm, denoted as DRL, is proposed to decrease total weighted sum of response time and energy consumption for all tasks. DRL trains two models with deep reinforcement learning to achieve two-level scheduling. The proposed algorithms are evaluated on a platform consisting of a desktop, Raspberry Pi, Eyeriss, OSM, SUMO, NS-3. Simulation results show that DRL outperforms the state-of-the-art methods for all cases, while the proposed GTA outperforms the state-ofthe-art methods for most cases, in terms of total weighted sum of response time and energy consumption. Compared with four baseline algorithms, GTA and DRL reduce the total weighted sum of response time and energy consumption by 41.49% and 62.38%, on average respectively, for different numbers of tasks.

引用

页码：9324 / 9343

页数：20

共 50 条

[1] Vehicle Speed Prediction by Two-Level Data Driven Models in Vehicular Networks
Jiang, Bingnan
Fei, Yunsi
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (07) : 1793 - 1801
[2] Two-Level Scheduling for Video Transmission over Downlink OFDMA Networks
Tham, Mau-Luen
Chow, Chee-Onn
Xu, Yi-Han
Ramli, Nordin
PLOS ONE, 2016, 11 (02):
[3] QoS-aware Two-level Dynamic Uplink Bandwidth Allocation Algorithms in IEEE 802.16j Based Vehicular Networks
Fei, Ridong
Yang, Kun
Ou, Shumao
Chen, Hsiao-Hwa
WIRELESS PERSONAL COMMUNICATIONS, 2011, 56 (03) : 417 - 433
[4] QoS-aware Two-level Dynamic Uplink Bandwidth Allocation Algorithms in IEEE 802.16j Based Vehicular Networks
Ridong Fei
Kun Yang
Shumao Ou
Hsiao-Hwa Chen
Wireless Personal Communications, 2011, 56 : 417 - 433
[5] Dynamic Early Exit Scheduling for Deep Neural Network Inference through Contextual Bandits
Ju, Weiyu
Bao, Wei
Ge, Liming
Yuan, Dong
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 823 - 832
[6] SAFE: Intelligent Online Scheduling for Collaborative DNN Inference in Vehicular Network
Zhou, Ruiting
Han, Ziyi
Zeng, Yifan
Zhou, Zhi
Wu, Libing
Wang, Wei
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 3230 - 3236
[7] A kind of Two-Level Cooperation Distributed Scheduling Strategy
Ruan, Dongru
Hua, Yu
Pang, Zhifeng
2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 410 - 413
[8] Collaborative Inference for Deep Neural Networks in Edge Environments
Liu, Meizhao
Gu, Yingcheng
Dong, Sen
Wei, Liu
Liu, Kai
Yan, Yuting
Song, Yu
Cheng, Huanyu
Tang, Lei
Zhang, Sheng
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (07): : 1749 - 1773
[9] Optimization of Analog Accelerators for Deep Neural Networks Inference
Fasoli, Andrea
Ambrogio, Stefano
Narayanan, Pritish
Tsai, Hsinyu
Mackin, Charles
Spoon, Katherine
Friz, Alexander
Chen, An
Burr, Geoffrey W.
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[10] Task Scheduling and Power Allocation in Multiuser Multiserver Vehicular Networks by NOMA and Deep Reinforcement Learning
Cong, Yuliang
Liu, Maiou
Wang, Cong
Sun, Shuxian
Hu, Fengye
Liu, Zhan
Wang, Chaoying
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (13): : 23532 - 23543

← 1 2 3 4 5 →