Designs of the Basic Block Reassembling Instruction Stream Buffer for X86 ISA

被引:0
作者
Chiu, Jih-Ching [1 ]
Chou, Yu-Liang [1 ]
Yeh, Ta-Li [1 ]
Lin, Tseng-Kuei [1 ]
机构
[1] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804, Taiwan
来源
2008 13th Asia-Pacific Computer Systems Architecture Conference | 2008年
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The potential performance of superscalar processors can be exploited only when processor is fed with sufficient instruction bandwidth. The front-end units, the Instruction Stream Buffer (ISB) and the fetcher, are the key elements for achieving this goal. Current ISBs cannot support instruction streaming beyond a basic block. In x86 processors, the split-line instruction problem worsens this situation. We proposed a basic blocks reassembling ISB in this paper. By cooperating with the proposed Line Weighted Branch Target Buffer (LWBTB), the ISB can predict advance branch information and reassemble cache lines. Front-End could fetch more valid instructions in a cycle by reassembling the original line containing instructions for the next basic block. Simulation results show that the cache line size over 64 bytes has a good chance to let two basic blocks in the reassembled instruction line and the fetch efficiency is about 90% as the fetch capacity is under 6.
引用
收藏
页码:60 / 67
页数:8
相关论文
共 10 条
[1]  
*AMD CORP, 2004, SOFTW OPT GUID AMD A
[2]  
CHIU JC, 2000, P IEEE INT C COMP DE
[3]  
CHIU JC, 2001, IEE P-COMPUT DIG T, P113
[4]  
CHIU JC, 1998, P INT C COMP SYST TE, P31
[5]  
MCFARLIN S, 1993, TN36 WRL DIG EQ CORP
[6]   Evaluation of design options for the trace cache fetch mechanism [J].
Patel, SJ ;
Friendly, DH ;
Patt, YN .
IEEE TRANSACTIONS ON COMPUTERS, 1999, 48 (02) :193-204
[7]   Trace cache: A low latency approach to high bandwidth instruction fetching [J].
Rotenberg, E ;
Bennett, S ;
Smith, JE .
PROCEEDINGS OF THE 29TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE - MICRO-29, 1996, :24-34
[8]   Modeled and measured instruction fetching performance for superscalar microprocessors [J].
Wallace, S ;
Bagherzadeh, N .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (06) :570-578
[9]  
WILTON SJE, 1994, 935 WRL DIG EQ CORP
[10]  
Yeh T.-Y., 1992, P 19 ANN INT S COMP, DOI [10.1145/139669.139709, DOI 10.1145/139669.139709]