An intelligent cache system with hardware prefetching for high performance

被引:7
作者
Lee, JH [1 ]
Jeong, SW
Kim, SD
Weems, CC
机构
[1] Yonsei Univ, Dept Comp Sci, Parallel Proc Lab, Seoul 120749, South Korea
[2] Samsung Elect Co Ltd, Syst LSI Business, Media SOC Team, Kihueng, South Korea
[3] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
基金
美国国家科学基金会;
关键词
memory hierarchy; dual data cache; temporal locality; spatial locality; prefetching;
D O I
10.1109/TC.2003.1197127
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a high performance cache structure with a hardware prefetching mechanism that enhances exploitation of spatial and temporal locality. The proposed cache, which we call a Selective-Mode Intelligent (SMI) cache, consists of three parts: a direct-mapped cache with a small block size, a fully associative spatial buffer with a large block size, and a hardware prefetching unit. Temporal locality is exploited by selectively moving small blocks into the direct-mapped cache after monitoring their activity in the spatial buffer for a time period. Spatial locality is enhanced by intelligently prefetching a neighboring block when a spatial buffer hit occurs. The overhead of this prefetching operation is shown to be negligible. We also show that the prefetch operation is highly accurate: Over 90 percent of all prefetches generated are for blocks that are subsequently accessed. Our results show that the system enables the cache size to be reduced by a factor of four to eight relative to a conventional direct-mapped cache while maintaining similar performance. Also, the SMI cache can reduce the miss ratio by around 20 percent and the average memory access time by 10 percent, compared with a victim-buffer cache configuration.
引用
收藏
页码:607 / 616
页数:10
相关论文
共 17 条
[1]   Power/performance advantages of victim buffer in high-performance processors [J].
Albera, G ;
Bahar, RI .
IEEE ALESSANDRO VOLTA MEMORIAL WORKSHOP ON LOW-POWER DESIGN, PROCEEDINGS, 1999, :43-51
[2]  
[Anonymous], P INT C SUP
[3]  
[Anonymous], 1991, P ACM IEEE C SUP SUP
[4]   OPTIMALLY PROFILING AND TRACING PROGRAMS [J].
BALL, T ;
LARUS, JR .
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1994, 16 (04) :1319-1360
[5]  
CHEN WY, 1992, P 25 INT S MICR, P92
[6]  
CHEN WY, 1991, P 24 ANN WORKSH MICR
[7]  
Jouppi N. P., 1990, Proceedings. The 17th Annual International Symposium on Computer Architecture (Cat. No.90CH2887-8), P364, DOI 10.1109/ISCA.1990.134547
[8]  
KURPANCHEK G, 1994, COMPCON FEB, P375
[9]   A new cache architecture based on temporal and spatial locality [J].
Lee, JH ;
Lee, JS ;
Kim, SD .
JOURNAL OF SYSTEMS ARCHITECTURE, 2000, 46 (15) :1451-1467
[10]  
MILUTINOVIC V, 1996, P 5 SCIZZI C MAR