Router-shared-pair mesh: a reconfigurable fault-tolerant network-on-chip architecture

被引:3
作者
Chen, Yali [1 ,2 ,3 ]
Ren, Kaixin [1 ,2 ,3 ]
Gu, Naijie [1 ,2 ,3 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei, Anhui, Peoples R China
[2] Anhui Prov Key Lab Comp & Commun Software, Hefei, Anhui, Peoples R China
[3] Univ Sci & Technol China, Inst Adv Technol, Hefei, Anhui, Peoples R China
关键词
network-on-chip; fault-tolerant; isolated PE problem; parted regions problem; topology reconfiguration; 2D mesh; router-shared-pair mesh;
D O I
10.1504/IJES.2018.095756
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a fault tolerant scheme on Network-on-Chip-based System-onChip (NoC-based SoC), for problems of Isolated Processing Element (PE) and Parted Regions caused by permanent faults. The scheme is referred to as Router-Shared-Pair mesh (RSPmesh). The topology architecture of the RSPmesh uses the design that a pair of neighbouring PEs share a pair of routers, and uses MUXs to provide diversity for link-connections between routers. Topology reconfiguration algorithm and routing algorithm corresponding to the RSPmesh are also proposed. Thus, when there are faulty routers or links, RSPmesh-based NoC can be reconfigured to a new 2D mesh NoC with maybe smaller size, but regular and with no faults, and it is able to serve all healthy PEs. The RSPmesh uses no spare routers, and only makes several routers disable according to actual needs in topology reconfiguration. Evaluation and experimental results show that the proposed scheme achieves significant improvements on reliability.
引用
收藏
页码:526 / 536
页数:11
相关论文
共 24 条
  • [1] [Anonymous], [No title captured]
  • [2] A survey of research and practices of network-on-chip
    Bjerregaard, Tobias
    Mahadevan, Shankar
    [J]. ACM COMPUTING SURVEYS, 2006, 38 (01) : 1 - 51
  • [3] A high-fault-coverage approach for the test of data, control, and handshake interconnects in mesh networks-on-chip
    Cota, Erika
    Kastensmidt, Fernanda Lima
    Cassel, Maico
    Herve, Marcos
    Almeida, Pedro
    Meirelles, Paulo
    Amory, Alexandre
    Lubaszewski, Marcelo
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2008, 57 (09) : 1202 - 1215
  • [4] Dally WJ, 2001, DES AUT CON, P684, DOI 10.1109/DAC.2001.935594
  • [5] Fault-tolerant adaptive routing under an unconstrained set of node and link failures for many-core systems-on-chip
    Dimopoulos, Michael
    Gang, Yi
    Anghel, Lorena
    Benabdenbi, Mounir
    Zergainoh, Nacer-Eddine
    Nicolaidis, Michael
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2014, 38 (06) : 620 - 635
  • [6] Dumitras T, 2003, ASP-DAC 2003: PROCEEDINGS OF THE ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, P225, DOI 10.1109/ASPDAC.2003.1195021
  • [7] Furber S, 2006, PROC EUR TEST SYMP, P4
  • [8] Jie Wu, 2004, International Journal of High Performance Computing and Networking, V1, P140
  • [9] Kosina H, 2006, INT J COMPUT SCI ENG, V2, P100, DOI 10.1504/IJCSE.2006.012762
  • [10] Kumar S, 2002, IEEE COMP SOC ANN, P117, DOI 10.1109/ISVLSI.2002.1016885