An Efficient Temporal Data Prefetcher for L1 Caches

被引:12
作者
Bakhshalipour, Mohammad [1 ]
Lotfi-Kamran, Pejman
Sarbazi-Azad, Hamid [1 ,2 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran 1458889694, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran 1956836681, Iran
关键词
Server workloads; L1-D misses; data prefetching; temporal correlation;
D O I
10.1109/LCA.2017.2654347
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Server workloads frequently encounter L1-D cache misses, and hence, lose significant performance potential. One way to reduce the number of L1-D misses or their effect is data prefetching. As L1-D access sequences have high temporal correlations, temporal prefetching techniques are promising for L1 caches. State-of-the-art temporal prefetching techniques are effective at reducing the number of L1-D misses, but we observe that there is a significant gap between what they offer and the opportunity. This work aims to improve the effectiveness of temporal prefetching techniques. To overcome the deficiencies of existing temporal prefetchers, we introduce Domino prefetching. Domino prefetcher is a temporal prefetching technique that looks up the history to find the last occurrence of the last one or two L1-D miss addresses for prefetching. We show that Domino prefetcher captures more than 87 percent of the temporal opportunity at L1-D. Through evaluation of a 16-core processor on a set of server workloads, we show that Domino prefetcher improves system performance by 26 percent (up to 56 percent).
引用
收藏
页码:99 / 102
页数:4
相关论文
共 10 条
[1]  
FERDMAN M, 2011, P 44 ANN IEEE ACM IN, P152
[2]  
Ferdman M, 2012, ASPLOS XVII: SEVENTEENTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, P37
[3]  
Jain Akanksha, 2013, 2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). Proceedings, P247, DOI 10.1145/2540708.2540730
[4]  
Jevdjic D., 2013, P 40 ANN INT S COMP, P223
[5]  
Lotfi-Kamran P, 2012, CONF PROC INT SYMP C, P500, DOI 10.1109/ISCA.2012.6237043
[6]   Identifying hierarchical structure in sequences: A linear-time algorithm [J].
NevillManning, CG ;
Witten, IH .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1997, 7 :67-82
[7]  
Shevgoor M., 1948, P 48 ANN IEEE ACM IN, P141
[8]  
Wenisch T. F., 2007, THESIS
[9]   SimFlex: Statistical sampling of computer system simulation [J].
Wenisch, Thomas F. ;
Wunderlich, Roland E. ;
Ferdman, Michael ;
Ailamaki, Anastassia ;
Falsafi, Babak ;
Hoe, James C. .
IEEE MICRO, 2006, 26 (04) :18-31
[10]  
Wenisch TF, 2009, INT S HIGH PERF COMP, P79, DOI 10.1109/HPCA.2009.4798239