Accurately Approximating Superscalar Processor Performance from Traces

被引:0
作者
Lee, Kiyeon [1 ]
Evans, Shayne [1 ]
Cho, Sangyeun [1 ]
机构
[1] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
来源
ISPASS 2009: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE | 2009年
关键词
SIMULATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Trace-driven simulation of superscalar processors is particularly complicated. The dynamic nature of superscalar processors combined with the static nature of traces can lead to large inaccuracies in the results, especially when traces contain only a subset of executed instructions for trace reduction. The main problem in the filtered trace simulation is that the trace does not contain enough information with which one can predict the actual penalty of a cache miss. In this paper, we discuss and evaluate three strategies to quantify the impact of a long latency memory access in a superscalar processor when traces have only L1 cache misses. The strategies are based on models about how a cache miss is treated with respect to other cache misses: (1) isolated cache miss model, (2) independent cache miss model, and (3) pairwise dependent cache miss model. Our experimental results demonstrate that the pairwise dependent cache miss model produces reasonably accurate results (4.8% RMS error) under perfect branch prediction. Our work forms a basis for fast, accurate, and configurable multicore processor simulation using a pre-determined processor core design.
引用
收藏
页码:238 / 248
页数:11
相关论文
共 25 条
[1]   SimpleScalar: An infrastructure for computer system modeling [J].
Austin, T ;
Larson, E ;
Ernst, D .
COMPUTER, 2002, 35 (02) :59-+
[2]  
Barnes L., 2007, INT S PERF AN SYST S
[3]   Can trace-driven simulators accurately predict superscalar performance? [J].
Black, B ;
Huang, AS ;
Lipasti, MH ;
Shen, JP .
INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1996, :478-485
[4]  
BORKAR S, 2005, TECHNOLOGY INTEL MAR
[5]  
CHAME J, 1993, P ACM SIGMETRICS C M, P248
[6]  
Chen X., 2008, P 11 INT S MAGN BEAR, P455
[7]   Representative traces for processor models with infinite cache [J].
Iyengar, VS ;
Trevillyan, LH ;
Bose, P .
SECOND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 1996, :62-72
[8]  
Johnson Mike, 1991, Superscalar Microprocessor Design
[9]  
Kaplan S. F., 2003, ACM Transactions on Modeling and Computer Simulation, V13, P1, DOI 10.1145/778553.778554
[10]  
Karkhanis TS, 2004, CONF PROC INT SYMP C, P338