Bingo Spatial Data Prefetcher

被引:83
作者
Bakhshalipour, Mohammad [1 ,2 ]
Shakerinava, Mehran [1 ]
Lotfi-Kamran, Pejman [2 ]
Sarbazi-Azad, Hamid [1 ,2 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
来源
2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA) | 2019年
关键词
Big-Data Applications; Memory System; Data Prefetching; Spatial Correlation; MEMORY; CACHE;
D O I
10.1109/HPCA.2019.00053
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Applications extensively use data objects with a regular and fixed layout, which leads to the recurrence of access patterns over memory regions. Spatial data prefetching techniques exploit this phenomenon to prefetch future memory references and hide the long latency of DRAM accesses. While state-of-the-art spatial data prefetchers are effective at reducing the number of data misses, we observe that there is still significant room for improvement. To select an access pattern for prefetching, existing spatial prefetchers associate observed access patterns to either a short event with a high probability of recurrence or a long event with a low probability of recurrence. Consequently, the prefetchers either offer low accuracy or lose significant prediction opportunities. We identify that associating the observed spatial patterns to just a single event significantly limits the effectiveness of spatial data prefetchers. In this paper, we make a case for associating the observed spatial patterns to both short and long events to achieve high accuracy while not losing prediction opportunities. We propose Bingo spatial data prefetcher in which short and long events are used to select the best access pattern for prefetching. We propose a storage-efficient design for Bingo in such a way that just one history table is needed to maintain the association between the access patterns and the long and short events. Through a detailed evaluation of a set of big-data applications, we show that Bingo improves system performance by 60% over a baseline with no data prefetcher and 11% over the best-performing prior spatial data prefetcher.
引用
收藏
页码:399 / 411
页数:13
相关论文
共 89 条
[21]  
Esmaili-Dokht P., 2018, ARXIV180804864
[22]   Last-touch correlated data streaming [J].
Ferdman, Michael ;
Falsafi, Babak .
ISPASS 2007: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2007, :105-+
[23]   Quantifying the Mismatch between Emerging Scale-Out Applications and Modern Processors [J].
Ferdman, Michael ;
Adileh, Almutaz ;
Kocberber, Onur ;
Volos, Stavros ;
Alisafaee, Mohammad ;
Jevdjic, Djordje ;
Kaynak, Cansu ;
Popescu, Adrian Daniel ;
Ailamaki, Anastasia ;
Falsafi, Babak .
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2012, 30 (04)
[24]  
Ferdman M, 2012, ASPLOS XVII: SEVENTEENTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, P37
[25]  
Franey S, 2015, INT S HIGH PERF COMP, P514, DOI 10.1109/HPCA.2015.7056059
[26]  
Guttman D, 2015, INT SYM PERFORM ANAL, P288, DOI 10.1109/ISPASS.2015.7095814
[27]  
Hameed R, 2010, CONF PROC INT SYMP C, P37, DOI 10.1145/1816038.1815968
[28]   TOWARD DARK SILICON IN SERVERS [J].
Hardavellas, Nikos ;
Ferdman, Michael ;
Falsafi, Babak ;
Ailamaki, Anastasia .
IEEE MICRO, 2011, 31 (04) :6-15
[29]  
Hardavellas N, 2009, CONF PROC INT SYMP C, P184, DOI 10.1145/1555815.1555779
[30]  
Hashemi M., 2016, P INT S MICR MICRO