Self-adaptive data caches for soft-error reliability

被引:10
作者
Wang, Shuai [1 ]
Hu, Jie [1 ]
Ziavras, Sotirios G. [1 ]
机构
[1] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
关键词
data cache; reliability; reliable system design; self adaptation; soft error;
D O I
10.1109/TCAD.2008.925789
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Soft-error induced reliability problems have become a major challenge in designing new generation microprocessors. Due to the on-chip caches' dominant share in die area and transistor budget, protecting them against soft errors is of paramount importance. Recent research has focused on the design of cost-effective reliable data caches in terms of performance, energy, and area overheads, based on the assumption of fixed error rates. However, for systems in operating environments that vary with time or location, those schemes will be either insufficient or overdesigned for the changing error rates. In this paper, we explore the design of a self-adaptive reliable data cache that dynamically adapts its employed reliability schemes to the changing operating environments thus to maintain a target reliability. The proposed data cache is implemented with three levels of error protection schemes, a monitoring mechanism, and a control component that decides whether to upgrade, downgrade, or keep the current protection level based on the feedback from the monitor. Our experimental evaluation using a set of SPEC CPU2000 benchmarks shows that our self-adaptive data cache achieves similar reliability to a cache protected by the most reliable scheme, while simultaneously minimizing the performance and power overheads.
引用
收藏
页码:1503 / 1507
页数:5
相关论文
共 23 条
  • [1] [Anonymous], 2006, P 1 INT C NANONETWOR
  • [2] [Anonymous], 2002, ASPLOS
  • [3] Balancing performance and reliability in the memory hierarchy[J]. Asadi, GH;Sridharan, V;Tahoori, MB;Kaeli, D. ISPASS 2005: IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2005
  • [4] Radiation-induced soft errors in advanced semiconductor technologies[J]. Baumann, RC. IEEE TRANSACTIONS ON DEVICE AND MATERIALS RELIABILITY, 2005(03)
  • [5] Computing architectural vulnerability factors for address-based structures[J]. Biswas, A;Racunas, P;Cheveresan, R;Emer, J;Mukherjee, SS;Rangan, R. 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2005
  • [6] Brooks D, 2000, PROCEEDING OF THE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, P83, DOI 10.1109/ISCA.2000.854380
  • [7] Burger D, 1997, 1342 U WISC MAD COMP
  • [8] Kaxiras S, 2001, ACM COMP AR, P240, DOI 10.1109/ISCA.2001.937453
  • [9] The alpha 21264 microprocessor[J]. Kessler, RE. IEEE MICRO, 1999(02)
  • [10] Kim S, 1999, CONF PROC INT SYMP C, P246