Reliability Assessment of Low-Power Processors in Supercomputing Systems

被引:0
作者
Shin, Insun [1 ]
Kwon, Daeil [2 ]
机构
[1] UNIST, Dept Syst Design & Control Engn, Ulsan 44919, South Korea
[2] UNIST, Sch Mech Aerosp & Nucl Engn, Ulsan 44919, South Korea
基金
新加坡国家研究基金会;
关键词
Failure Modes; Mechanisms; and Effects Analysis; Lifetime Prediction; Low-Power Processors; Physics of Failure; Supercomputing Systems; JOINT;
D O I
10.1166/nnl.2017.2474
中图分类号
TB3 [工程材料学];
学科分类号
0805 ; 080502 ;
摘要
Low-power processors have emerged as an alternative for supercomputers and cloud computers to reduce their energy consumption. Many low-power processors are designed and used for mobile devices, such as phones, tablets, and laptops, and their reliability is commonly evaluated based on mobile use conditions. Supercomputers are, on the other hand, often used for high-performance computing, including complex calculations, graphics processing, and large-scale modeling and simulations, which generates heavy workload and stress conditions. Therefore, the reliability of low-power processors in supercomputing systems can be of concern. This paper assessed the lifetime of low-power processors used in high-performance computing environments. Failure modes, mechanisms, and effects analysis identified thermo-mechanical fatigue at board level solder joints as a critical failure mechanism of low-power processors. An HPC use condition was estimated based on the supercomputer use statistics collected from a supercomputing center. Experimental studies were conducted to correlate the use conditions with changes in the package temperature. A physics-of- failure-based reliability model was used to estimate the lifetime of low-power processors under multiple load ratios with or without active cooling. Recommendations for reliable operation of low-power processor-based supercomputers were presented.
引用
收藏
页码:1241 / 1245
页数:5
相关论文
共 24 条
  • [1] [Anonymous], 2013, B110B JESD22 JEDEC
  • [2] [Anonymous], 2013, A100D JESD22 JEDEC
  • [3] [Anonymous], 2015, A110E JESD22 JEDEC
  • [4] [Anonymous], 2013, B113A JESD22 JEDEC
  • [5] [Anonymous], 2016, A106B JESD22 JEDEC
  • [6] [Anonymous], 2004, A105C JESD22 JEDEC
  • [7] [Anonymous], 2015, A101D JESD22 JEDEC
  • [8] [Anonymous], 2014, A104E JESD22 JEDEC
  • [9] [Anonymous], 2016, A122A JESD22 JEDEC
  • [10] [Anonymous], 2015, A103E JESD22 JEDEC