RAISE: Efficient GPU Resource Management via Hybrid Scheduling

被引:2
作者
Weng, Yue [1 ]
Ge, Tianao [1 ]
Zhang, Xi [1 ]
Zhang, Xianwei [1 ]
Lu, Yutong [1 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
来源
2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022) | 2022年
基金
中国国家自然科学基金;
关键词
Resource management; GPU; Scheduling; MULTITASKING; PREEMPTION;
D O I
10.1109/CCGrid54584.2022.00078
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the de facto high-throughput accelerators, graphics processing units (GPUs) are now used in a wide spectrum of fields, including artificial intelligence, high performance computing and finance. While with excessive computing and memory resources, GPUs are facing significant challenges to reach high utilization by a monolithic task. Multiple tasks are thus concurrently running to share the GPUs, but they may adversely affect each other, causing performance degradation. As a result, it is extremely critical to manage resources in a reasonable way to strike a balance between utilization and performance. Targeting the issue, this paper proposes an effective resource management design via hybrid task scheduling. Our design continuously tracks the GPU executions and collects the usage statistics, which are then used to direct the task selection and dispatch, including the type, starting time and kernel dimensions. A prototype is developed on off-the-shelf GPUs by moderately refactoring the CUDA source codes. Experimental results show that the design can achieve up to 1.96x performance improvement (1.51x on average), meanwhile effectively boosting resource utilization.
引用
收藏
页码:685 / 694
页数:10
相关论文
共 43 条
[1]  
Adriaens JT, 2012, INT S HIGH PERF COMP, P79
[2]  
Aguilera P, 2014, PR IEEE COMP DESIGN, P440, DOI 10.1109/ICCD.2014.6974717
[3]  
AMD, AMD INST MI200
[4]  
AMD, AMD HIP GUID
[5]  
AMD, AMD POL GPU ARCH WHI
[6]  
[Anonymous], 2016, Top500 supercomputer
[7]  
[Anonymous], 2012, P USENIX ANN TECH C
[8]  
[Anonymous], OPENCL
[9]   Understanding GPU Power: A Survey of Profiling, Modeling, and Simulation Methods [J].
Bridges, Robert A. ;
Imam, Neena ;
Mintz, Tiffany M. .
ACM COMPUTING SURVEYS, 2016, 49 (03)
[10]  
Che SA, 2009, I S WORKL CHAR PROC, P44, DOI 10.1109/IISWC.2009.5306797