Load balancing-oriented fault-tolerant NoC design

被引:0
|
作者
Tan, Ta
Chen, Xiaowen [1 ]
Li, Chen
Lu, Jianzhuang
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha, Peoples R China
来源
8TH INTERNATIONAL TEST CONFERENCE IN ASIA, ITC-ASIA 2024 | 2024年
关键词
NoC; fault-tolerant; reliability; load balance-oriented;
D O I
10.1109/ITC-Asia62534.2024.10661350
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Network-on-Chip (NoC) has been widely applied in modern chip multiprocessors due to its high bandwidth and scalability. However, as technology advances to the nanometer scale, NoC is increasingly vulnerable to errors caused by crosstalk, radiation, electromagnetic interference, etc. Conventional Switch-to-Switch (S2S) fault-tolerant designs based on ECC have overlooked the characteristic of the distribution of traffic load. This oversight not only increases area overhead significantly but also leads to low average utilization of ECC decoder modules. In this paper, we analyze the distribution of traffic load in mesh network and propose a load balancing-oriented fault-tolerant NoC design. The core idea is to allocate different numbers of ECC decoder modules to each router based on the distribution of traffic load, aiming to improve the average utilization of ECC decoder modules and reduce the area overhead without compromising fault-tolerant capability of NoC. The experiment under 6 common synthetic traffic patterns shows that compared to the baseline, our design exhibits an average delay performance loss of less than 0.88%. Additionally, the maximum reduction in the number of ECC decoder modules is 160, the maximum reduction in the area overhead of NoC is 15.06%, and the maximum improvement in the average utilization of ECC decoder modules is 1.21x. Furthermore, the experiment under PARSEC benchmarks shows that compared to the baseline, our design exhibits an average delay performance loss of less than 0.08%. Additionally, the maximum reduction in the number of ECC decoder modules is 156, the maximum reduction in total NoC area overhead is 14.69%, and the maximum improvement in the average utilization of ECC decoder modules is 1.13x.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Fault-Tolerant Routing With Load Balancing in LeTQ Networks
    Fan, Weibei
    Xiao, Fu
    Fan, Jianxi
    Han, Zhijie
    Sun, Lijuan
    Wang, Ruchuan
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (01) : 68 - 82
  • [2] Research on Fault-Tolerant Routing Mechanism of NoC
    Hou, Guowei
    Yu, Lixin
    Song, Liguo
    Peng, Heping
    Zhuang, Wei
    PROCEEDINGS OF THE 2016 3RD INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING, MANUFACTURING TECHNOLOGY AND CONTROL, 2016, 67 : 1657 - 1663
  • [3] Review on Fault-Tolerant NoC Designs
    Jun-Shi Wang
    Le-Tian Huang
    Journal of Electronic Science and Technology, 2018, 16 (03) : 191 - 221
  • [4] The Fault-Tolerant NoC Techniques with FPGA
    Lu, Zhi
    Jiang, Shu Yan
    Huang, Le Tian
    Wu, Chao
    Luo, Gang
    Li, Qi
    Song, Guo Ming
    2015 IEEE International Conference on Applied Superconductivity and Electromagnetic Devices (ASEMD), 2015, : 54 - 55
  • [5] NoC-Based Fault-Tolerant Cache Design in Chip Multiprocessors
    Banaiyanmofrad, Abbas
    Girao, Gustavo
    Dutt, Nikil
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2014, 13
  • [6] Service Oriented Architecture For Load Balancing With Fault Tolerant In Grid Computing
    Indhumathi, V.
    Nasira, G. M.
    2016 IEEE INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER APPLICATIONS (ICACA), 2016, : 313 - 317
  • [7] Fault-tolerant MIN Shuffle-Exchange NoC
    Mazaheri, Ali
    Sabbaghi-Nadooshan, Reza
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [8] A Fault-Tolerant NoC Scheme Using Bidirectional Channel
    Tsai, Wen-Chung
    Zheng, Deng-Yuan
    Chen, Sao-Jie
    Hu, Yu-Hen
    PROCEEDINGS OF THE 48TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2011, : 918 - 923
  • [9] Oddlab: fault-tolerant aware load-balancing framework for data center networks
    Aymen Hasan Alawadi
    Sándor Molnár
    Annals of Telecommunications, 2022, 77 : 641 - 662
  • [10] Low Overhead Monitor Mechanism for Fault-tolerant Analysis of NoC
    Liu, Junxiu
    Harkin, Jim
    Li, Yuhua
    Maguire, Liam
    Linares-Barranco, Alejandro
    2014 IEEE 8TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANYCORE SOCS (MCSOC), 2014, : 189 - 196