Improving Yield and Reliability of Chip Multiprocessors

被引:0
作者
Pan, Abhisek [1 ]
Khan, Omer [1 ]
Kundu, Sandip [1 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
来源
DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3 | 2009年
关键词
yield; reliability; micorarchitecture; multiprocessors;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An increasing number of hardware failures can be attributed to device reliability problems that cause partial system failure or shutdown. In this paper we propose a scheme for improving reliability of a homogeneous chip multiprocessor (CMP) that also serves to improve manufacturing yield. Our solution centers on exploiting the natural redundancy that already exists in multi-core systems by using services from other cores for functional units that are defective in a faulty core. A micro-architectural modification allows a core on a CMP to use another core as a coprocessor to service any instruction that the former cannot execute correctly. This service is accessed to improve yield and reliability, but at the cost of some loss of performance. In order to quantify this loss we have used a cycle-accurate simulator to simulate the performance of a dual-core system with one or two cores sustaining partial failure. Our results indicate that when a large and sparingly-used unit such as a floating point arithmetic unit fails in a core, even for a floating point intensive benchmark, we can continue to run each faulty core with help from companion cores with as little as 10% impact to performance and less than 1% area overhead.
引用
收藏
页码:490 / 495
页数:6
相关论文
共 50 条
[31]   Synergistic Reliability and Yield Enhancement Techniques for Embedded SRAMs [J].
Lu, Shyue-Kung ;
Huang, Huan-Hua ;
Huang, Jiun-Lang ;
Ning, Pony .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2013, 32 (01) :165-169
[32]   On the Reliability of FeFET On-Chip Memory [J].
Genssler, Paul R. ;
van Santen, Victor M. ;
Henkel, Joerg ;
Amrouch, Hussam .
IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (04) :947-958
[33]   Adaptive ECC Techniques for Yield and Reliability Enhancement of Flash Memories [J].
Lu, Shyue-Kung ;
Zhong, Shang-Xiu ;
Hashizume, Masaki .
2016 IEEE 25TH ASIAN TEST SYMPOSIUM (ATS), 2016, :287-292
[34]   Dynamic Transfer of Computation to Processor Cache for Yield and Reliability Improvement [J].
Paul, Somnath ;
Bhunia, Swarup .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2011, 19 (08) :1368-1379
[35]   Reliability assessment of delamination in chip-to-chip bonded MEMS packaging [J].
Swaminathan, R ;
Bhaskaran, H ;
Sandborn, PA ;
Subramanian, G ;
Deeds, MA ;
Cochran, KR .
IEEE TRANSACTIONS ON ADVANCED PACKAGING, 2003, 26 (02) :141-151
[36]   IMPROVING SUBSTATION RELIABILITY AND AVAILABILITY [J].
Spiewak, Robert M. ;
Pieniazek, Dominik ;
Pittman, Jerry ;
Weisse, Floyd ;
Wilson, Daniel .
INDUSTRY APPLICATIONS SOCIETY 56TH ANNUAL PETROLEUM AND CHEMICAL INDUSTRY CONFERENCE, 2009, :351-+
[37]   Improving the Reliability of Semiconductor Converters [J].
Korzhavin M.E. ;
Zhuravlev A.M. ;
Grigorev M.A. .
Russian Electrical Engineering, 2020, 91 (07) :457-460
[38]   On Graceful Degradation of Chip Multiprocessors in Presence of Faults via Flexible Pooling of Critical Execution Units [J].
Rodrigues, Rance ;
Kundu, Sandip .
2011 IEEE 17TH INTERNATIONAL ON-LINE TESTING SYMPOSIUM (IOLTS), 2011,
[39]   Fault Leveling Techniques for Yield and Reliability Enhancement of NAND Flash Memories [J].
Lu, Shyue-Kung ;
Zhong, Shang-Xiu ;
Hashizume, Masaki .
JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 2018, 34 (05) :559-570
[40]   Effects of Bonding Parameters on the drop impact reliability of microbumps in chip on chip interconnection [J].
Luo, Honglong ;
Li, Ganglong ;
Su, Qi ;
Zhu, Wenhui ;
Chen, Zhuo .
2017 18TH INTERNATIONAL CONFERENCE ON ELECTRONIC PACKAGING TECHNOLOGY (ICEPT), 2017, :1376-1380