Chip Multiprocessor Design Space Exploration through Statistical Simulation

被引:29
作者
Genbrugge, Davy [1 ]
Eeckhout, Lieven [1 ]
机构
[1] Univ Ghent, Dept Elect & Informat Syst ELIS, B-9000 Ghent, Belgium
关键词
Performance of systems (modeling techniques; simulation);
D O I
10.1109/TC.2009.77
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Developing fast chip multiprocessor simulation techniques is a challenging problem. Solving this problem is especially valuable for design space exploration purposes during the early stages of the design cycle where a large number of design points need to be evaluated quickly. This paper studies statistical simulation as a fast simulation technique for chip multiprocessor (CMP) design space exploration. The idea of statistical simulation is to measure a number of program execution characteristics from a real program execution through profiling, to generate a synthetic trace from it, and simulate that synthetic trace as a proxy for the original program. The important benefit is that the synthetic trace is much shorter compared to a real program trace, which leads to substantial simulation speedups. This paper enhances state-of-the-art statistical simulation: 1) by modeling the memory address stream behavior in a more microarchitecture-independent way and 2) by modeling a program's time-varying execution behavior. These two enhancements enable accurately modeling resource conflicts in shared resources as observed in the memory hierarchy of contemporary chip multiprocessors when multiple programs are coexecuting on the CMP. Our experimental evaluation using the SPEC CPU benchmarks demonstrates average prediction error of 7.3 percent across a range of CMP configurations while varying the number of cores and memory hierarchy configurations.
引用
收藏
页码:1668 / 1681
页数:14
相关论文
共 31 条
[1]  
[Anonymous], 2007, P 2007 IEEE INT S WO
[2]  
[Anonymous], 2000, ASPLOS 9
[3]   Accelerating multiprocessor simulation with a memory timestamp record [J].
Barr, KC ;
Pan, H ;
Zhang, M ;
Asanovic, K .
ISPASS 2005: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2005, :66-77
[4]   The M5 simulator: Modeling networked systems [J].
Binkert, Nathan L. ;
Dreslinski, Ronald G. ;
Hsu, Lisa R. ;
Lim, Kevin T. ;
Saidi, Ali G. ;
Reinhardt, Steven K. .
IEEE MICRO, 2006, 26 (04) :52-60
[5]  
CARL R, 1998, P WORKSH PERF AN ITS
[6]   Predicting inter-thread cache contention on a chip multi-processor architecture [J].
Chandra, D ;
Guo, F ;
Kim, S ;
Solihin, Y .
11TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, :340-351
[7]   FPGA-accelerated simulation technologies (FAST): Fast, full-system, cycle-accurate simulators [J].
Chiou, Derek ;
Sunwoo, Dam ;
Kim, Joonsoo ;
Patil, Nikhil A. ;
Reinhart, William ;
Johnson, D. Eric ;
Keefe, Jebediah ;
Angepat, Hari .
MICRO-40: PROCEEDINGS OF THE 40TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2007, :249-261
[8]  
Eeckhout L, 2004, CONF PROC INT SYMP C, P350
[9]  
Eeckhout L, 2001, 2001 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, P25, DOI 10.1109/PACT.2001.953285
[10]   Enhancing multiprocessor architecture simulation speed using matched-pair comparison [J].
Ekman, M ;
Stenstrom, P .
ISPASS 2005: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2005, :89-99