Modeling and Analysis of Fault-tolerant Distributed Memories for Networks-on-Chip

被引:0
作者
BanaiyanMofrad, Abbas [1 ]
Dutt, Nikil [1 ]
Girao, Gustavo [2 ]
机构
[1] Univ Calif Irvine, Ctr Embedded Comp Syst, Irvine, CA 92697 USA
[2] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
来源
DESIGN, AUTOMATION & TEST IN EUROPE | 2013年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in technology scaling increasingly make Network-on-Chips (NoCs) more susceptible to failures that cause various reliability challenges. With increasing area occupied by different on-chip memories, strategies for maintaining fault-tolerance of distributed on-chip memories become a major design challenge. We propose a system-level design methodology for scalable fault-tolerance of distributed on-chip memories in NoCs. We introduce a novel reliability clustering model for fault-tolerance analysis and shared redundancy management of onchip memory blocks. We perform extensive design space exploration applying the proposed reliability clustering on a block-redundancy fault-tolerant scheme to evaluate the tradeoffs between reliability, performance, and overheads. Evaluations on a 64-core chip multiprocessor (CMP) with an 8x8 mesh NoC show that distinct strategies of our case study may yield up to 20% improvements in performance gains and 25% improvement in energy savings across different benchmarks, and uncover interesting design configurations.
引用
收藏
页码:1605 / 1608
页数:4
相关论文
共 50 条
  • [31] A Runtime Fault-Tolerant Routing Scheme for Partially Connected 3D Networks-on-Chip
    Coelho, Alexandre
    Charif, Amir
    Zergainoh, Nacer-Eddine
    Velazco, Raoul
    [J]. 2018 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2018,
  • [32] Fault-Tolerant Networks-on-Chip Routing With Coarse and Fine-Grained Look-Ahead
    Liu, Junxiu
    Harkin, Jim
    Li, Yuhua
    Maguire, Liam P.
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2016, 35 (02) : 260 - 273
  • [33] Fault-tolerant Communication in Invasive Networks on Chip
    Heisswolf, Jan
    Weichslgartner, Andreas
    Zaib, Aurang
    Friederich, Stephanie
    Masing, Leonard
    Stein, Carsten
    Duden, Marco
    Kloepfer, Roman
    Teich, Juergen
    Wild, Thomas
    Herkersdorf, Andreas
    Becker, Juergen
    [J]. 2015 NASA/ESA CONFERENCE ON ADAPTIVE HARDWARE AND SYSTEMS (AHS), 2015,
  • [34] Reconfigurable fault tolerant routing for networks-on-chip with logical hierarchy
    Schley, Gert
    Ahmed, Ibrahim
    Afzal, Muhammad
    Radetzki, Martin
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2016, 51 : 195 - 206
  • [35] Mapping a Fault-Tolerant Distributed Algorithm to Systems on Chip
    Fuchs, Gottfried
    Fuegger, Matthias
    Schmid, Ulrich
    Steininger, Andreas
    [J]. 11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 242 - 249
  • [36] A New Scalable Fault Tolerant Routing Algorithm for Networks-on-Chip
    Kia, Hamed Sajjadi
    Ababei, Cristinel
    Srinivasan, Sudarshan
    Jabeen, Shaista
    [J]. 2015 IEEE 58TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2015,
  • [37] Minimal-Path Fault-Tolerant Approach Using Connection-Retaining Structure in Networks-on-Chip
    Ebrahimi, Masoumeh
    Daneshtalab, Masoud
    Plosila, Juha
    Tenhunen, Hannu
    [J]. 2013 SEVENTH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2013), 2013,
  • [38] DISTRIBUTED RECOVERY IN FAULT-TOLERANT MULTIPROCESSOR NETWORKS
    YANNEY, RM
    HAYES, JP
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1986, 35 (10) : 871 - 879
  • [39] ON RELIABILITY MODELING OF FAULT-TOLERANT DISTRIBUTED SYSTEMS
    THAMBIDURAI, P
    PARK, YK
    TRIVEDI, KS
    [J]. 9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 136 - 142
  • [40] ON THE FAULT-TOLERANT ROUTING IN DISTRIBUTED LOOP NETWORKS
    Liu Huanping Yang Yixian (Po Box 126
    [J]. JournalofElectronics(China), 2000, (01) : 84 - 89