Balanced instruction cache: Reducing conflict misses of direct-mapped caches through balanced subarray accesses

被引:4
作者
Department of Electrical and Computer Engineering, San Diego State University [1 ]
机构
[1] Department of Electrical and Computer Engineering, San Diego State University
来源
IEEE Comput. Archit. Lett. | 2006年 / 1卷 / 2-5期
关键词
D O I
10.1109/L-CA.2006.5
中图分类号
学科分类号
摘要
It is observed that the limited memory space of direct-mapped caches is not used in balance therefore incurs extra conflict misses. We propose a novel cache organization of a balanced cache, which balances accesses to cache sets at the granularity of cache subarrays. The key technique of the balanced cache is a programmable sitbarray decoder thmugh which the mapping of memory reference addivsses to . cache subarrays can be optimized hence conflict misses of direct-mapped caches can be resolved. The experimental results show that the miss rate of balanced cache is lower than that of the same sized two-way, set-associative caches on average and can be as low as that of the same sized four-way set-associative caches for particular applications. Compared with previous techniques, the balanced cache requires only one cycle to access all cache hits and has the same access time as direct-mapped caches.
引用
收藏
页码:2 / 5
页数:3
相关论文
共 9 条
  • [1] Agarwal A., Pudar S.D., Column-Associative Caches: A Technique for Reducing the Miss Rate of Direct-Mapped Caches, Proc. of the 20th International Symposium on Computer Architecture, pp. 179-180, (1993)
  • [2] Hirose T., Et al., A 20ns 4Mb CMOS SRAM with Hierarchical Word Decoding Architecture, International Solid-State Circuits Conference, (1990)
  • [3] Juan T., Lang T., Navarro J., The Difference-bit Cache, Proceedings of the 27th Annual International Symposium on Computer Architecture, (1996)
  • [4] Johnson T.L., Hwu W.W., Run-time Adaptive Cache Hierarchy Management via Reference Analysis, Proceedings of International Symposium on Computer Architecture, (1997)
  • [5] Jouppi N., Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers, Proceedings of International Symposium on Computer Architecture, (1990)
  • [6] Liu L., Cache Design with Partial Address Matching, Proceedings of the 27th Annual International Symposium on Microarchitecture, (1994)
  • [7] Santhanam S., et. al., A Low-Cost, 300-MHz, RISC CPU with Attached Media Processor, IEEE Journal of Solid-State Circuit, 33, 11, (1998)
  • [8] Peir J.K., Lee Y., Hsu W.W., Capturing Dynamic Memory Reference Behavior with Adaptive Cache Topology, Proc. of the 8th International Conference on Architectural Support for Programming Language and Operating Systems, pp. 540-250, (1998)
  • [9] Yoshimoto M., et.at, A Divided Word-Line Structure in the Static RAM and its Application to a 64k Full CMOS RAM, IEEE J. Solid-State Circuits, SC-21, pp. 479-485, (1983)