CASTED: Core-Adaptive Software Transient Error Detection for Tightly Coupled Cores

被引:3
作者
Mitropoulou, Konstantina [1 ]
Porpodas, Vasileios [1 ]
Cintra, Marcelo [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh EH8 9YL, Midlothian, Scotland
来源
IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013) | 2013年
关键词
adaptation; error detection;
D O I
10.1109/IPDPS.2013.107
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Aggressive silicon process scaling over the last years has made transistors faster and less power consuming. Meanwhile, transistors have become more susceptible to errors. The need to maintain high reliability has led to the development of various software-based error detection methodologies which target either single-core or multi-core processors. In this work, we present CASTED, a Core-Adaptive Software Transient Error Detection methodology that focuses on improving the impact of error detection overhead on single-chip scalable architectures that are composed of tightly coupled cores. The proposed compiler methodology adaptively distributes the error detection overhead to the available resources across multiple cores, fully exploiting the abundant ILP of these architectures. CASTED adapts to a wide range of architecture configurations (issue-width, inter-core delay). We evaluate our technique on a range of architecture configurations using the MediabenchII video and SPEC CINT2000 benchmark suites. Our approach successfully adapts to (and regularly outperforms by up to 21.2%) the best fixed state-of-the-art approach while maintaining the same fault coverage.
引用
收藏
页码:513 / 524
页数:12
相关论文
共 37 条
[1]  
Ando H., 2003, DAC
[2]  
[Anonymous], 2004, DSN
[3]  
[Anonymous], GNU COMP COLL
[4]  
[Anonymous], [No title captured]
[5]  
[Anonymous], 2007, DSN
[6]  
[Anonymous], IBM J RES DEV
[7]  
[Anonymous], 2002, ISCA
[8]  
[Anonymous], IEEE T RELIABILITY
[9]  
[Anonymous], IEEE T COMPUTERS
[10]  
[Anonymous], MICRO