Design of Fault-Tolerant and Reliable Networks-on-Chip

被引:8
作者
Wang, Junshi [1 ]
Ebrahimi, Masoumeh [2 ,3 ]
Huang, Letian [1 ]
Jantsch, Axel [4 ]
Li, Guangjun [1 ]
机构
[1] UESTC, Chengdu, Peoples R China
[2] KTH, Stockholm, Sweden
[3] Univ Turku, SF-20500 Turku, Finland
[4] TU Wien, Vienna, Austria
来源
2015 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI | 2015年
关键词
Fault-tolerant design flow; Networks-on-Chip; Fault modelling; ARCHITECTURE; LOGIC;
D O I
10.1109/ISVLSI.2015.33
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Networks-on-Chips (NoCs) are at the core of high performance multi-processor systems-on-chips. As the number of cores and sub-systems on chip grows, the size and complexity of NoCs increase as well. Due to the process variation, aging effects and soft-errors in current and expected future process generations, the probability of failure in the NoCs rises and has to be fought at all levels: circuit, architecture, and communication protocols. This paper discusses appropriate fault models for NoCs and their effects on the architecture and network levels. A method to design fault-tolerant NoCs comprising of techniques at the link level, the routing level, and the end-to-end level of the communication is presented. In addition, the proposed method offers an isolation technique where the computing cores are decoupled from the faults in the network. This technique avoids or at least attenuates the severe impacts of faults on the network performance and functionality. These point techniques are combined together to design fault-tolerant and reliable NoCs.
引用
收藏
页码:545 / 550
页数:6
相关论文
共 50 条
[31]   A Survey on Design Approaches to Circumvent Permanent Faults in Networks-on-Chip [J].
Werner, Sebastian ;
Navaridas, Javier ;
Lujan, Mikel .
ACM COMPUTING SURVEYS, 2016, 48 (04)
[32]   Graceful Fault-tolerant On-chip Spike Routing Algorithm for Mesh-based Spiking Neural Networks [J].
Vu, The H. ;
Murakami, Yuji ;
Ben Abdallah, Abderazek .
2019 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2019), 2019, :76-80
[33]   On the Capacity of Bufferless Networks-on-Chip [J].
Shpiner, Alexander ;
Kantor, Erez ;
Li, Pu ;
Cidon, Israel ;
Keslassy, Isaac .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (02) :492-506
[34]   Statistical Approach to Networks-on-Chip [J].
Cohen, Itamar ;
Rottenstreich, Ori ;
Keslassy, Isaac .
IEEE TRANSACTIONS ON COMPUTERS, 2010, 59 (06) :748-761
[35]   Fault-Tolerant Communication in HSDC: Ensuring Reliable Data Transmission in Smart Cities [J].
Dong, Hui ;
Lv, Mengjie ;
Fan, Weibei .
IEEE TRANSACTIONS ON RELIABILITY, 2024, 73 (04) :1933-1945
[36]   Fault Localizing End-to-End Flow Control Protocol for Networks-on-Chip [J].
Schley, Gert ;
Batzolis, Nikolaos ;
Radetzki, Martin .
PROCEEDINGS OF THE 2013 21ST EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING, 2013, :454-461
[37]   Fault-tolerant Routing for On-chip Network Without Using Virtual Channels [J].
Ren, Pengju ;
Meng, Qingxin ;
Ren, Xiaowei ;
Zheng, Nanning .
2014 51ST ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2014,
[38]   FAC: A Fault-Tolerant Design Approach Based on Approximate Computing [J].
Balasubramanian, Padmanabhan ;
Maskell, Douglas L. .
ELECTRONICS, 2023, 12 (18)
[39]   A systematic approach towards fault-tolerant design of QCA circuits [J].
Kumar, Dharmendra ;
Mitra, Debasis .
ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 98 (03) :501-515
[40]   \ Trace-Driven Optimization of Networks-on-Chip Configurations [J].
Kahng, Andrew B. ;
Lin, Bill ;
Samadi, Kambiz ;
Ramanujam, Rohit Sunkam .
PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, :437-442