Using dynamic binary translation to fuse dependent instructions

被引:0
作者
Hu, SL [1 ]
Smith, JE [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
来源
CGO 2004: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION | 2004年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Instruction scheduling hardware can be simplified and easily pipelined if pairs of dependent instructions are fused so they share a single instruction scheduling slot. We study an implementation of the x86 ISA that dynamically translates x86 code to an underlying ISA that supports instruction fusing. A microarchitecture that is co-designed with the fused instruction set completes the implementation. In this paper, we focus on the dynamic binary translator for such a co-designed x86 virtual machine. The dynamic binary translator first cracks x86 instructions belonging to hot superblocks into RISC-style micro-operations, anti then uses heuristics to fuse together pairs of dependent micro-operations. Experimental results with SPEC2000 integer benchmarks demonstrate that: (1) the fused ISA with dynamic binary translation reduces the number of scheduling decisions by about 30% versus a conventional implementation that uses hardware cracking into RISC micro-operations; (2) an instruction scheduling slot needs only hold two source register fields even though it may hold two instructions; (3) translations generated in the proposed ISA consume about 30% less storage than a corresponding fixed-length RISC-style ISA.
引用
收藏
页码:213 / 224
页数:12
相关论文
共 24 条
[1]  
BALA V, 2000, INT S PROGR LANG DES, P1
[2]  
Bekerman M, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P306, DOI [10.1109/ISCA.2000.854400, 10.1145/342001.339705]
[3]  
*BOCHS, OP SOURC IA 32 EM PR
[4]   Select-free instruction scheduling logic [J].
Brown, MD ;
Stark, J ;
Patt, YN .
34TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO-34, PROCEEDINGS, 2001, :204-213
[5]  
CANAL R, 2000, P 14 INT C SUP, P327
[6]  
Canal Ramon, 2001, P 15 INT C SUPERCOMP, P312, DOI DOI 10.1145/377792.377854
[7]   Dynamic binary translation and optimization [J].
Ebcioglu, K ;
Altman, E ;
Gschwind, M ;
Sathaye, S .
IEEE TRANSACTIONS ON COMPUTERS, 2001, 50 (06) :529-548
[8]  
Ernst D, 2002, CONF PROC INT SYMP C, P37
[9]  
GOCHAMN S, 2003, INTEL TECHNOLOGY J, V7
[10]  
GWENNAP L, 1995, MICRO PROCESSOR REPO, V9