Exploiting procedure level locality to reduce instruction cache misses

被引:0
作者
Batchu, RV [1 ]
Jiménez, DA [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08855 USA
来源
EIGHTH WORKSHOP ON INTERACTION BETWEEN COMPILERS AND COMPUTER ARCHITECTURES, PROCEEDINGS | 2004年
关键词
D O I
10.1109/INTERA.2004.1299512
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High instruction fetch bandwidth is essential for high performance in today's wide-issue out-of-order processors. Instruction caches must provide a low miss rate as well as low latency. We introduce Procedure Level Relocation, a class of dynamic feedback-directed optimizations that substantially reduce the instruction cache miss rate by exploiting the temporal locality of procedure usage. Based on the observation that half of all procedures executed are at most 128 bytes in length, we present a Small Procedure Cache, a small and fast explicitly managed memory for storing small procedures. We show that Procedure Level Relocation into a Small Procedure Cache reduces the instruction cache miss rate by an average of 15%.
引用
收藏
页码:75 / 84
页数:10
相关论文
共 50 条
[41]   Locality-Aware Data Replication in the Last-Level Cache [J].
Kurian, George ;
Devadas, Srinivas ;
Khan, Omer .
2014 20TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA-20), 2014, :1-12
[42]   Exploiting superword level parallelism with multimedia instruction sets [J].
Larsen, S ;
Amarasinghe, S .
ACM SIGPLAN NOTICES, 2000, 35 (05) :145-156
[43]   Exploiting instruction- and data-level parallelism [J].
Espasa, R ;
Valero, M .
IEEE MICRO, 1997, 17 (05) :20-27
[44]   Exploiting Static and Dynamic Locality of Timing Errors in Robust L1 Cache Design [J].
Chen, Hu ;
Roy, Sanghamitra ;
Chakraborty, Koushik .
PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2014), 2015, :9-15
[45]   Balanced instruction cache: Reducing conflict misses of direct-mapped caches through balanced subarray accesses [J].
Department of Electrical and Computer Engineering, San Diego State University .
IEEE Comput. Archit. Lett., 2006, 1 (2-5) :2-5
[46]   Exploiting intra-chip locality for multi-chip GPUs via two-level shared L1 cache [J].
Xu, Xiangrong ;
Wang, Liang ;
Xiao, Limin ;
Liu, Lei ;
Zhou, Zihao ;
Lv, Yuanqiu ;
Ruan, Li ;
Xie, Xilong ;
Han, Meng ;
Liao, Xiaojian .
JOURNAL OF SYSTEMS ARCHITECTURE, 2025, 167
[47]   Exploiting Cache Conflicts to Reduce Radiation Sensitivity of Operating Systems on Embedded Systems [J].
Santini, Thiago ;
Rech, Paolo ;
Carro, Luigi ;
Wagner, Flavio Rech .
2015 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURE AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2015, :49-58
[48]   Exploiting Reuse Locality on Inclusive Shared Last-Level Caches [J].
Albericio, Jorge ;
Ibanez, Pablo ;
Vinals, Victor ;
Maria Llaberia, Jose .
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2013, 9 (04)
[49]   EC-Cache: Exploiting Error Locality to Optimize LDPC in NAND Flash-Based SSDs [J].
Liu, Ren-Shuo ;
Chuang, Meng-Yen ;
Yang, Chia-Lin ;
Li, Cheng-Hsuan ;
Ho, Kin-Chu ;
Li, Hsiang-Pang .
2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
[50]   HYPERGRAPH PARTITIONING BASED MODELS AND METHODS FOR EXPLOITING CACHE LOCALITY IN SPARSE MATRIX- VECTOR MULTIPLICATION [J].
Akbudak, Kadir ;
Kayaaslan, Enver ;
Aykanat, Cevdet .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2013, 35 (03) :C237-C262