An Efficient Scheduling Algorithm for Multi-mode Tasks on Near-Data Processing SSDs

被引:0
作者
Li, Guo [1 ]
Chen, Xianzhang [1 ]
Liu, Duo [1 ]
Li, Jiali [1 ]
Tan, Yujuan [1 ]
Ren, Ao [1 ]
机构
[1] Chongqing Univ, Chongqing 400044, Peoples R China
来源
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT VII | 2024年 / 14493卷
基金
中国国家自然科学基金;
关键词
Near data processing; Computational storage; Multi-mode task scheduling;
D O I
10.1007/978-981-97-0862-8_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Near-Data Processing (NDP) architectures have been proposed to alleviate the large overhead of data movement between the host and the Computational Storage Device (CSD) by offloading tasks to the CSD. In NDP architectures, each task can run in multiple modes according to the resource it takes for computing, such as the CPU of the host, the accelerator or the processor of the CSD. However, existing task scheduling algorithms on NDP architectures are unaware of the multi-mode tasks, leading to increased completion time of tasks and low resource utilization. In this paper, we propose a Multi-Mode Task Scheduling (MMTS) algorithm to optimize the completion time of the multi-mode tasks in NDP architectures. MMTS employs a greedy strategy to fully use the computing resources in the host and the CSD and align the completion time of the tasks by picking the proper modes. Our experimental results show that MMTS achieves 20.6% performance improvement on average over the state-of-the-art task scheduling algorithm on NDP-based system.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 14 条
[1]  
[Anonymous], 2013, JBER, DOI DOI 10.19030/JBER.V11I11.8193
[2]   Cost-effective, Energy-efficient, and Scalable Storage Computing for Large-scale AI Applications [J].
Do, Jaeyoung ;
Ferreira, Victor C. ;
Bobarshad, Hossein ;
Torabzadehkashi, Mahdi ;
Rezaei, Siavash ;
Heydarigorji, Ali ;
Souza, Diego ;
Goldstein, Brunno F. ;
Santiago, Leandro ;
Kim, Min Soo ;
Lima, Priscila M., V ;
Franca, Felipe M. G. ;
Alves, Vladimir .
ACM TRANSACTIONS ON STORAGE, 2020, 16 (04)
[3]   Stannis: Low-Power Acceleration of DNN Training Using Computational Storage Devices [J].
HeydariGorji, Ali ;
Torabzadehkashi, Mahdi ;
Rezaei, Siavash ;
Bobarshad, Hossein ;
Alves, Vladimir ;
Chou, Pai H. .
PROCEEDINGS OF THE 2020 57TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2020,
[4]   A GENETIC ALGORITHM FOR MULTIPROCESSOR SCHEDULING [J].
HOU, ESH ;
ANSARI, N ;
REN, H .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (02) :113-120
[5]   Exploring and Exploiting the Multilevel Parallelism Inside SSDs for Improved Performance and Endurance [J].
Hu, Yang ;
Jiang, Hong ;
Feng, Dan ;
Tian, Lei ;
Luo, Hao ;
Ren, Chao .
IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (06) :1141-1155
[6]   Summarizer: Trading Communication with Computing Near Storage [J].
Koo, Gunjae ;
Matam, Kiran Kumar ;
Te, I ;
Narra, H. V. Krishna Giri ;
Li, Jing ;
Tseng, Hung-Wei ;
Swanson, Steven ;
Annavaram, Murali .
50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2017, :219-231
[7]   Cosmos plus OpenSSD: Rapid Prototype for Flash Storage Systems [J].
Kwak, Jaewook ;
Lee, Sangjin ;
Park, Kibin ;
Jeong, Jinwoo ;
Song, Yong Ho .
ACM TRANSACTIONS ON STORAGE, 2020, 16 (03)
[8]   Horae: A Hybrid I/O Request Scheduling Technique for Near-Data Processing-Based SSD [J].
Li, Jiali ;
Chen, Xianzhang ;
Liu, Duo ;
Li, Lin ;
Wang, Jiapin ;
Zeng, Zhaoyang ;
Tan, Yujuan ;
Qiao, Lei .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) :3803-3813
[9]  
Liang SW, 2019, PROCEEDINGS OF THE 2019 USENIX ANNUAL TECHNICAL CONFERENCE, P395
[10]  
Reinsel D., 2018, DIGITALIZATION WORLD