[2] Univ Toulouse, CNRS, INSA, LAAS, Toulouse, France
来源:
2023 IEEE 29TH REAL-TIME AND EMBEDDED TECHNOLOGY AND APPLICATIONS SYMPOSIUM, RTAS
|
2023年
关键词:
Response Time Analysis;
Non-preemptive;
Gang Scheduling;
Tensor Processing Unit;
PRIORITY ASSIGNMENT;
TIME;
PERFORMANCE;
PROGRESS;
D O I:
10.1109/RTAS58335.2023.00019
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
Non-preemptive rigid gang scheduling combines the performance benefits of parallel execution with the low overhead of non-preemptive scheduling and rigid task programming model. This approach appears particularly well-suited for parallel hardware accelerators where the context switch and migration overheads are critical and should be avoided. One of the most notable examples today is Google's Edge Tensor Processing Unit (TPU) used for neural network inference on embedded boards. The paper studies sporadic non-preemptive rigid gang scheduling applied to multi-TPU edge AI accelerators. Each gang task spawns a fixed number of threads that must execute simultaneously on distinct processing units. We consider non-preemptive fixed-priority gang (NP-FP-Gang) scheduling and propose the first carry-in limitation for gang task response time analysis. The gang task carry-in limitation differs from conventional sequential tasks due to the intra-task parallelism. We formulate it as a generalized knapsack problem and develop a linear programming relaxation and a dynamic programming approach to solve the problem under different time complexities. The performance of the proposed schedulability analysis is evaluated through randomly generated synthetic task sets and a case study using neural network benchmarks executed on commercial off-the-shelf multi-TPU edge AI accelerators. The evaluation results show that the proposed response time analysis effectively improves the state-of-the-art NP-FP-Gang schedulability test even by 85.7% for the Edge TPU benchmarks in particular.