Two-Level Scheduling Algorithms for Deep Neural Network Inference in Vehicular Networks

被引:3
|
作者
Wu, Yalan [1 ,2 ]
Wu, Jigang [3 ]
Yao, Mianyang [3 ]
Liu, Bosheng [3 ]
Chen, Long [3 ]
Lam, Siew Kei [2 ]
机构
[1] Guangdong Univ Technol, Sch Integrated Circuits, Guangzhou 510006, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[3] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Vehicular network; two-level task scheduling; DNN inference; quality of computing services; accelerator; RESOURCE-ALLOCATION; EDGE; ACCELERATION;
D O I
10.1109/TITS.2023.3266795
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
In vehicular networks, task scheduling at the microarchitecture-level and network-level offers tremendous potential to improve the quality of computing services for deep neural network (DNN) inference. However, existing task scheduling works only focus on either one of the two levels, which results in inefficient utilization of computing resources. This paper aims to fill this gap by formulating a two-level scheduling problem for DNN inference tasks in a vehicular network, with an objective of minimizing total weighted sum of response time and energy consumption for all tasks under the following constraints: per task response time, per vehicle energy consumption, per vehicle storage capacity. We first formulate the problem and prove that it is NP-hard. A group transformation based algorithm, called GTA, is proposed. GTA makes scheduling decisions at the network-level using the group transformation based approach, and at the microarchitecture-level using a greedy strategy. In addition, an algorithm, denoted as DRL, is proposed to decrease total weighted sum of response time and energy consumption for all tasks. DRL trains two models with deep reinforcement learning to achieve two-level scheduling. The proposed algorithms are evaluated on a platform consisting of a desktop, Raspberry Pi, Eyeriss, OSM, SUMO, NS-3. Simulation results show that DRL outperforms the state-of-the-art methods for all cases, while the proposed GTA outperforms the state-ofthe-art methods for most cases, in terms of total weighted sum of response time and energy consumption. Compared with four baseline algorithms, GTA and DRL reduce the total weighted sum of response time and energy consumption by 41.49% and 62.38%, on average respectively, for different numbers of tasks.
引用
收藏
页码:9324 / 9343
页数:20
相关论文
共 50 条
  • [1] Vehicle Speed Prediction by Two-Level Data Driven Models in Vehicular Networks
    Jiang, Bingnan
    Fei, Yunsi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2017, 18 (07) : 1793 - 1801
  • [2] Two-Level Scheduling for Video Transmission over Downlink OFDMA Networks
    Tham, Mau-Luen
    Chow, Chee-Onn
    Xu, Yi-Han
    Ramli, Nordin
    PLOS ONE, 2016, 11 (02):
  • [3] QoS-aware Two-level Dynamic Uplink Bandwidth Allocation Algorithms in IEEE 802.16j Based Vehicular Networks
    Fei, Ridong
    Yang, Kun
    Ou, Shumao
    Chen, Hsiao-Hwa
    WIRELESS PERSONAL COMMUNICATIONS, 2011, 56 (03) : 417 - 433
  • [4] QoS-aware Two-level Dynamic Uplink Bandwidth Allocation Algorithms in IEEE 802.16j Based Vehicular Networks
    Ridong Fei
    Kun Yang
    Shumao Ou
    Hsiao-Hwa Chen
    Wireless Personal Communications, 2011, 56 : 417 - 433
  • [5] Dynamic Early Exit Scheduling for Deep Neural Network Inference through Contextual Bandits
    Ju, Weiyu
    Bao, Wei
    Ge, Liming
    Yuan, Dong
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 823 - 832
  • [6] SAFE: Intelligent Online Scheduling for Collaborative DNN Inference in Vehicular Network
    Zhou, Ruiting
    Han, Ziyi
    Zeng, Yifan
    Zhou, Zhi
    Wu, Libing
    Wang, Wei
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 3230 - 3236
  • [7] A kind of Two-Level Cooperation Distributed Scheduling Strategy
    Ruan, Dongru
    Hua, Yu
    Pang, Zhifeng
    2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 410 - 413
  • [8] Collaborative Inference for Deep Neural Networks in Edge Environments
    Liu, Meizhao
    Gu, Yingcheng
    Dong, Sen
    Wei, Liu
    Liu, Kai
    Yan, Yuting
    Song, Yu
    Cheng, Huanyu
    Tang, Lei
    Zhang, Sheng
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (07): : 1749 - 1773
  • [9] Optimization of Analog Accelerators for Deep Neural Networks Inference
    Fasoli, Andrea
    Ambrogio, Stefano
    Narayanan, Pritish
    Tsai, Hsinyu
    Mackin, Charles
    Spoon, Katherine
    Friz, Alexander
    Chen, An
    Burr, Geoffrey W.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [10] Task Scheduling and Power Allocation in Multiuser Multiserver Vehicular Networks by NOMA and Deep Reinforcement Learning
    Cong, Yuliang
    Liu, Maiou
    Wang, Cong
    Sun, Shuxian
    Hu, Fengye
    Liu, Zhan
    Wang, Chaoying
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (13): : 23532 - 23543