Improving chip multiprocessor reliability through code replication

被引:1
|
作者
Ozturk, Ozcan [1 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
关键词
Reliability; Code replication; Energy consumption; Chip multiprocessors; Compilers; CONCURRENT ERROR-DETECTION;
D O I
10.1016/j.compeleceng.2009.11.004
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Chip multiprocessors (CMPs) are promising candidates for the next generation computing platforms to utilize large numbers of gates and reduce the effects of high interconnect delays. One of the key challenges in CMP design is to balance out the often-conflicting demands. Specifically, for today's image/video applications and systems, power consumption, memory space occupancy, area cost, and reliability are as important as performance. Therefore, a compilation framework for CMPs should consider multiple factors during the optimization process. Motivated by this observation, this paper addresses the energy-aware reliability support for the CMP architectures, targeting in particular at array-intensive image/video applications. There are two main goals behind our compiler approach. First, we want to minimize the energy wasted in executing replicas when there is no error during execution (which should be the most frequent case in practice). Second, we want to minimize the time to recover (through the replicas) from an error when it occurs. This approach has been implemented and tested using four parallel array-based applications from the image/video processing domain. Our experimental evaluation indicates that the proposed approach saves significant energy over the case when all the replicas are run under the highest voltage/frequency level, without sacrificing any reliability over the latter. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:480 / 490
页数:11
相关论文
共 50 条
  • [1] Energy-aware code replication for improving reliability in embedded chip multiprocessors
    Chen, Guilin
    Ozturk, Ozcan
    Chen, Guangyu
    Kandemir, Mahmut
    IEEE INTERNATIONAL SOC CONFERENCE, PROCEEDINGS, 2006, : 77 - +
  • [2] On improving the Reliability of Internet Services through Active Replication
    Ayari, Narjess
    Barbaron, Denis
    Lefevre, Laurent
    PDCAT 2008: NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES, PROCEEDINGS, 2008, : 259 - +
  • [3] Thread Criticality Assisted Replication and Migration for Chip Multiprocessor Caches
    Li, Jianhua
    Li, Minming
    Xue, Chun Jason
    Ouyang, Yiming
    Shen, Fanfan
    IEEE TRANSACTIONS ON COMPUTERS, 2017, 66 (10) : 1747 - 1762
  • [4] Improving TLB performance on current chip multiprocessor architectures through demand-driven superpaging
    Qasem, Apan
    Magee, Josh
    SOFTWARE-PRACTICE & EXPERIENCE, 2013, 43 (06): : 705 - 729
  • [5] Improving Yield and Reliability of Chip Multiprocessors
    Pan, Abhisek
    Khan, Omer
    Kundu, Sandip
    DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 490 - 495
  • [6] Post-Silicon Code Coverage for Multiprocessor System-on-Chip Designs
    Balston, Kyle
    Karimibiuki, Mehdi
    Hu, Alan J.
    Ivanov, Andre
    Wilton, Steven J. E.
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (02) : 242 - 246
  • [7] Analyzing and Improving Performance Scalability of Commercial Server Workloads on a Chip Multiprocessor
    Ishizaki, Kazuaki
    Nakatani, Toshio
    Daijavad, Shahrokh
    PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL SYMPOSIUM ON WORKLOAD CHARACTERIZATION, 2009, : 217 - 226
  • [8] Chip Multiprocessor Design Space Exploration through Statistical Simulation
    Genbrugge, Davy
    Eeckhout, Lieven
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (12) : 1668 - 1681
  • [9] Reliability-Aware Heterogeneous 3D Chip Multiprocessor Design
    Ismail Akturk
    Ozcan Ozturk
    Journal of Electronic Testing, 2013, 29 : 177 - 184
  • [10] Unified System Level Reliability Evaluation Methodology for Multiprocessor Systems-on-Chip
    Yamamoto, Alexandre Yasuo
    Ababei, Cristinel
    2012 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2012,