Asymmetry-aware load balancing for parallel applications in single-ISA multi-core systems

被引:0
作者
Eunsung Kim
Hyeonsang Eom
Heon Y. Yeom
机构
[1] Seoul National University,School of Computer Science and Engineering
来源
Journal of Zhejiang University SCIENCE C | 2012年 / 13卷
关键词
Scheduler; Load balancing; Capability asymmetry; OS noise; Multi-core; TP316;
D O I
暂无
中图分类号
学科分类号
摘要
Contemporary operating systems for single-ISA (instruction set architecture) multi-core systems attempt to distribute tasks equally among all the CPUs. This approach works relatively well when there is no difference in CPU capability. However, there are cases in which CPU capability differs from one another. For instance, static capability asymmetry results from the advent of new asymmetric hardware, and dynamic capability asymmetry comes from the operating system (OS) outside noise caused from networking or I/O handling. These asymmetries can make it hard for the OS scheduler to evenly distribute the tasks, resulting in less efficient load balancing. In this paper, we propose a user-level load balancer for parallel applications, called the’ capability balancer’, which recognizes the difference of CPU capability and makes subtasks share the entire CPU capability fairly. The balancer can coexist with the existing kernel-level load balancer without detrimenting the behavior of the kernel balancer. The capability balancer can fairly distribute CPU capability to tasks with very little overhead. For real workloads like the NAS Parallel Benchmark (NPB), we have accomplished speedups of up to 9.8% and 8.5% in dynamic and static asymmetries, respectively. We have also experienced speedups of 13.3% for dynamic asymmetry and 24.1% for static asymmetry in a competitive environment. The impacts of our task selection policies, FIFO (first in, first out) and cache, were compared. The use of the cache policy led to a speedup of 5.3% in overall execution time and a decrease of 4.7% in the overall cache miss count, compared with the FIFO policy, which is used by default.
引用
收藏
页码:413 / 427
页数:14
相关论文
共 34 条
[1]  
Asanovic K.(2009)A view of the parallel computing landscape Commun. ACM 52 56-67
[2]  
Bodik R.(1991)The NAS Parallel Benchmarks Int. J. Supercomput. Appl. 5 63-73
[3]  
Demmel J.(2006)Operating system issues for petascale systems ACM SIGOPS Oper. Syst. Rev. 40 29-33
[4]  
Keaveny T.(2008)Benchmarking the effects of operating system interference on extreme-scale parallel machines Cluster Comput. 11 3-16
[5]  
Keutzer K.(1989)Analysis and simulation of a fair queueing algorithm ACM SIGCOMM Comput. Commun. Rev. 19 1-12
[6]  
Kubiatowicz J.(2008)Amdahl’s law in the multicore era IEEE Comput. 41 33-38
[7]  
Morgan N.(undefined)undefined undefined undefined undefined-undefined
[8]  
Patterson D.(undefined)undefined undefined undefined undefined-undefined
[9]  
Sen K.(undefined)undefined undefined undefined undefined-undefined
[10]  
Wawrzynek J.(undefined)undefined undefined undefined undefined-undefined