Exploiting procedure level locality to reduce instruction cache misses

被引:0
作者
Batchu, RV [1 ]
Jiménez, DA [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA
来源
EIGHTH WORKSHOP ON INTERACTION BETWEEN COMPILERS AND COMPUTER ARCHITECTURES, PROCEEDINGS | 2004年
关键词
D O I
10.1109/INTERA.2004.1299512
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High instruction fetch bandwidth is essential for high performance in today's wide-issue out-of-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedback-directed optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 50 条
[21]   A low energy cache design for multimedia applications exploiting set access locality [J].
Yang, J ;
Yu, J ;
Zhang, YT .
JOURNAL OF SYSTEMS ARCHITECTURE, 2005, 51 (10-11) :653-664
[22]   Live-cache: Exploiting data redundancy to reduce leakage energy in a cache subsystem [J].
Kabadi, MG ;
Parthasarathi, R .
ADVANCES IN COMPUTER SYSTEMS ARCHITECTURE, 2003, 2823 :337-351
[23]   Exploiting the locality of memory references to reduce the address bus energy [J].
Musoll, E ;
Lang, T ;
Cortadella, J .
1997 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, PROCEEDINGS, 1997, :202-207
[24]   Guest editors' introduction - Cache memory and related problems: Enhancing and exploiting the locality [J].
Milutinovic, V ;
Valero, M .
IEEE TRANSACTIONS ON COMPUTERS, 1999, 48 (02) :97-99
[25]   Exploiting program hotspots and code sequentiality for instruction cache leakage management [J].
Hu, JS ;
Nadgir, A ;
Vijaykrishnan, N ;
Irwin, MJ ;
Kandemir, M .
ISLPED'03: PROCEEDINGS OF THE 2003 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2003, :402-407
[26]   Exploiting a computation reuse cache to reduce energy in network processors [J].
Li, BG ;
Venkatesh, G ;
Calder, B ;
Gupta, R .
HIGH PERFORMANCE EMBEDDED ARCHITECTURES AND COMPLIERS, PROCEEDINGS, 2005, 3793 :251-265
[27]   Dynamic round-robin task scheduling to reduce cache misses for embedded systems [J].
Batcher, Ken W. ;
Walker, Robert A. .
2008 DESIGN, AUTOMATION AND TEST IN EUROPE, VOLS 1-3, 2008, :1332-+
[28]   iCFP: Tolerating All-Level Cache Misses in In-Order Processors [J].
Hilton, Andrew ;
Nagarakatte, Santosh ;
Roth, Amir .
HPCA-15 2009: FIFTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2009, :431-442
[29]   Energy-Efficient GPU L2 Cache Design Using Instruction-Level Data Locality Similarity [J].
Tan, Jingweijia ;
Yan, Kaige ;
Song, Shuaiwen Leon ;
Fu, Xin .
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2020, 25 (06)
[30]   ICFP: TOLERATING ALL-LEVEL CACHE MISSES IN IN-ORDER PROCESSORS [J].
Hilton, Andrew ;
Nagarakatte, Santosh ;
Roth, Amir .
IEEE MICRO, 2010, 30 (01) :12-19