Hardware-aware thread scheduling: the case of asymmetric multicore processors

被引:1
作者
Peternier, Achille [1 ]
Ansaloni, Danilo [1 ]
Bonetta, Daniele [1 ]
Pautasso, Cesare [1 ]
Binder, Walter [1 ]
机构
[1] Univ Lugano USI, Lugano, Switzerland
来源
PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012) | 2012年
关键词
multicore; workload characterization; asymmetric processors; performance;
D O I
10.1109/ICPADS.2012.62
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Modern processor architectures are increasingly complex and heterogeneous, often requiring solutions tailored to the specific characteristics of each processor model. In this paper we address this problem by targeting the AMD Bulldozer processor as case study for specific hardware-oriented performance optimizations. The Bulldozer architecture features an asymmetric simultaneous multithreading implementation with shared floating point units (FPUs) and per-core arithmetic logic units (ALUs). BulldOver, presented in this paper, improves thread scheduling by exploiting this hardware characteristic to increase performance of floating point-intensive workloads on Linux-based operating systems. BulldOver is a user-space monitoring tool that automatically identifies FPU-intensive threads and schedules them in a more efficient way without requiring any patches or modifications at the kernel level. Our measurements using standard benchmark suites show that speedups of up to 10% can be achieved by simply allowing BulldOver to monitor applications, without any modification of the workload.
引用
收藏
页码:400 / 407
页数:8
相关论文
共 20 条
[1]  
[Anonymous], 2011, P 9 INT C PRINC PRAC
[2]  
[Anonymous], P INT C COMP SEC ICC
[3]  
[Anonymous], 2011, USENIXATC 11
[4]  
[Anonymous], P WORKL CHAR METH CA
[5]  
[Anonymous], P 2 INT WORKSH PERF
[6]  
[Anonymous], 2008, ART MULTIPROCESSOR P
[7]  
Bertran Ramon, 2010, 24th ACM International Conference on Supercomputing 2010, P147
[8]   Contention-Aware Scheduling on Multicore Systems [J].
Blagodurov, Sergey ;
Zhuravlev, Sergey ;
Fedorova, Alexandra .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2010, 28 (04)
[9]  
Demme J, 2011, ISCA 2011: PROCEEDINGS OF THE 38TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P353, DOI 10.1145/2024723.2000107
[10]   Performance Profiling of Virtual Machines [J].
Du, Jiaqing ;
Sehrawat, Nipun ;
Zwaenepoel, Willy .
ACM SIGPLAN NOTICES, 2011, 46 (07) :3-14