Silent Data Corruption - Myth or Reality?

被引:12
作者
Constantinescu, Cristian
Parulkar, Ishwar
Harper, Rick
Michalak, Sarah
机构
来源
2008 IEEE INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS & NETWORKS WITH FTCS & DCC | 2008年
关键词
D O I
10.1109/DSN.2008.4630077
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The higher complexity of the hardware and software employed by modern computing systems, as well as semiconductor technology scaling, are increasing the likelihood of Silent Data Corruption (SDC). SDC occurs when incorrect data is provided to the user, e.g., written to the memory or I/O system, and no error is triggered. Such events may have catastrophic effects, in the case of life critical applications, and represent a significant cost penalty for businesses. The purpose of this panel is to provide real examples of silent corruption, and discuss solutions for avoiding it. The presentations address SDC generated at the semiconductor device level, as well as the virtualization software level. Techniques for reducing SDC, from the circuit to system level, will be covered. Results of an extensive SDC study, carried out at Los Alamos National Laboratory (LANL) on high-performance computing (HPC) platforms are also given.
引用
收藏
页码:108 / 109
页数:2
相关论文
empty
未找到相关数据