Reliability-Aware Test Methodology for Detecting Short-Channel Faults in On-Chip Networks

被引:10
作者
Bhowmik, Biswajit [1 ]
Biswas, Santosh [1 ]
Deka, Jatindra Kumar [1 ]
Bhattacharya, Bhargab B. [2 ]
机构
[1] IIT Guwahati, Dept Comp Sci & Engn, Gauhati 781039, India
[2] Indian Stat Inst Kolkata, Adv Comp & Microelect Unit, Kolkata 700108, India
关键词
Coverage metrics; fault modeling and testing; intra and interchannel shorts; network-on-chip (NoC); performance metrics; system-level failures; NOC; PERFORMANCE; INTERCONNECTS;
D O I
10.1109/TVLSI.2018.2803478
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the advent of rapidly evolving nanoelectronic systems, compact implementation of versatile and dense network-on-chips (NoCs) on a die has emerged as technology-of-choice for multicore computing. However, because of the increased density, NoCs often suffer from various types of manufacturing faults, which degrade the yield and jeopardize the reliability of the overall system. For example, short-channel faults in an NoC often cause system-level failures that may have significant impact on its performance. This paper proposes a cluster-based distributed scheme for online testing of short faults in NoC channels. The proposed algorithm detects both intra and interchannel short faults and identifies the underlying faulty channel-wires connected to a node. The nodes in a cluster-set are appropriately scheduled to reduce test time. The approach scales to larger NoCs irrespective of size of the network and channel width. The proposed scheme also extends its application from regular to irregular NoCs, and to other channel faults like open. Fault simulation shows that the proposed cluster-driven scheme is capable of detecting all modeled short-channel faults. Online evaluation of the scheme also reveals the extent of impact that the short faults impart on various performance metrics for large traffic. Compared to prior work, it reduces hardware area overhead up to 27% and test time by more than 21x on several test cases. In addition, packet latency and energy consumption are reduced by 19.47%-40.16% and 17.57%-34.20%, respectively.
引用
收藏
页码:1026 / 1039
页数:14
相关论文
共 38 条
[1]   Post-Silicon Platform for the Functional Diagnosis and Debug of Networks-on-Chip [J].
Abdel-Khalek, Rawan ;
Bertacco, Valeria .
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
[2]  
Abed S., 2017, P 6 INT C SOFTW COMP, P233
[3]  
[Anonymous], 2014, 2014 IEEE 32 VLSI TE
[4]   Xpipes: A network-on-chip architecture for gigascale systems-on-chip [J].
Bertozzi, Davide ;
Benini, Luca .
IEEE Circuits and Systems Magazine, 2004, 4 (02) :18-31
[5]  
Bhowmik B., 2015, P IEEE 12 IND C INDI, P1
[6]  
Bhowmik B., 2016, P IEEE 22 NAT C COMM, P1, DOI 10.1109/NCC.2016.7561197
[7]  
Bhowmik B, 2016, IEEE SYS MAN CYBERN, P4561, DOI 10.1109/SMC.2016.7844950
[8]  
Bhowmik B, 2016, PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS
[9]  
IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY
[10]  
IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), P530, DOI [10.1109/HPCC-SmartCity-DSS.2016.5, 10.1109/HPCC-SmartCity-DSS.2016.0081]