Per-Thread Cycle Accounting in SMT Processors

被引:28
作者
Eyerman, Stijn [1 ]
Eeckhout, Lieven [1 ]
机构
[1] Univ Ghent, ELIS Dept, Ghent, Belgium
关键词
Design; Experimentation; Performance; Simultaneous Multithreading (SMT); Cycle accounting; Thread-progress aware fetch policy; PERFORMANCE;
D O I
10.1145/1508284.1508260
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a cycle accounting architecture for Simultaneous Multithreading (SMT) processors that estimates the execution times for each of the threads had they been executed alone, while they are running simultaneously on the SMT processor. This is done by accounting each cycle to either a base, miss event or waiting cycle component during multi-threaded execution. Single-threaded alone execution time is then estimated as the sum of the base and miss event components; the waiting cycle component represents the lost cycle count due to SMT execution. The cycle accounting architecture incurs reasonable hardware cost (around 1KB of storage) and estimates single-threaded performance with average prediction errors around 7.2% for two-program workloads and 11.7% for four-program workloads. The cycle accounting architecture has several important applications to system software and its interaction with SMT hardware. For one, the estimated single-thread alone execution time provides an accurate picture to system software of the actually consumed processor cycles per thread. The alone execution time instead of the total execution time (timeslice) may make system software scheduling policies more effective. Second, a new class of thread-progress aware SMT fetch policies based on per-thread progress indicators enable system software level priorities to be enforced at the hardware level.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 29 条
[1]  
[Anonymous], 2000, ASPLOS 9
[2]   Software-controlled priority characterization of POWER5 processor [J].
Boneti, Carlos ;
Cazorla, Francisco J. ;
Gioiosa, Roberto ;
Buyuktosunoglu, Alper ;
Cher, Chen-Yong ;
Valero, Mateo .
ISCA 2008 PROCEEDINGS: 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2008, :415-+
[3]  
Cazorla FJ, 2004, INT SYMP MICROARCH, P171
[4]   QoS for high-performance smt processors in embedded systems [J].
Cazorla, FJ ;
Ramirez, A ;
Valero, M ;
Knijnenburg, PMW ;
Sakellariou, R ;
Fernández, E .
IEEE MICRO, 2004, 24 (04) :24-31
[5]   Predictable performance in SMT processors:: Synergy between the OS and SMTs [J].
Cazorla, Francisco J. ;
Knijnenburg, Peter M. W. ;
Sakellariou, Rizos ;
Fernandez, Enrique ;
Ramirez, Alex ;
Valero, Mateo .
IEEE TRANSACTIONS ON COMPUTERS, 2006, 55 (07) :785-799
[6]  
Choi SG, 2006, CONF PROC INT SYMP C, P239, DOI 10.1145/1150019.1136507
[7]   Microarchitecture optimizations for exploiting memory-level parallelism [J].
Chou, Y ;
Fahs, B ;
Abraham, S .
31ST ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2004, :76-87
[8]  
COTAROBLES E, 2003, Patent No. 6658447
[9]  
DEAN J, 1997, MICRO
[10]  
EMER J, 2001, EV8 POSTULTIMATE ALP