GPU Architecture Aware Instruction Scheduling for Improving Soft-Error Reliability

被引:5
|
作者
Lee H. [1 ]
Al Faruque M.A. [1 ]
机构
[1] Department of Electrical Engineering and Computer Science, University of California, Irvine, 92697, CA
关键词
compiler; GPGPU; instruction scheduling; reliability; soft-error;
D O I
10.1109/TMSCS.2017.2667661
中图分类号
学科分类号
摘要
The demand for low-power and high-performance computing has been driving the semiconductor industry for decades. The semiconductor technology has been scaled down to satisfy these demands. At the same time, the semiconductor technology has faced severe reliability challenges like soft-error. Research has been conducted to improve the soft-error reliability of the GPU, which has been improved by using various methodologies such as redundancy methodologies. However, the GPU compiler has yet to be considered for improving the soft-error reliability of the GPU. In this paper, in order to improve the soft-error reliability of the GPU, we propose a novel GPU architecture aware compilation methodology. The proposed methodology jointly considers the parallel behavior of the GPU hardware and the applications, and minimizes the vulnerability of the GPU applications during instruction scheduling. In addition, the proposed methodology is able to complement any hardware based soft-error reliability improvement techniques. We compared our compilation methodology with the state-of-the-art soft-error reliability aware techniques and the performance aware instruction scheduling. We have injected the soft-errors during the experiments and have compared the number of correct executions that have no erroneous output. Our methodology requires less performance and power overhead than the state-of-the-art soft-error reliability methodologies in most cases. Compilation time overhead of our methodology is 8.13 seconds on average. The experimental results show that our methodology improves the soft-error reliability by 23 percent and 12 percent (up to 64 percent and 52 percent) compared to the state-of-the-art soft-error reliability and performance aware compilation techniques, respectively. Moreover, we have shown that the soft-error reliability of a GPU is not related to the performance, but to the fine-grained timing behavior of an application. © 2015 IEEE.
引用
收藏
页码:86 / 99
页数:13
相关论文
共 50 条
  • [21] Latency-Aware Selection of Check Variables for Soft-Error Tolerant Datapath Synthesis
    Oh, Junghoon
    Kaneko, Mineo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (07) : 1506 - 1510
  • [22] Soft Error Reliability Aware Placement and Routing for FPGAs
    Abdul-Aziz, Mohammed A.
    Tahoori, Mehdi B.
    INTERNATIONAL TEST CONFERENCE 2010, 2010,
  • [23] HW/SW architecture for soft-error cancellation in real-time operating system
    Neishaburi, M. H.
    Kakoee, M. R.
    Daneshtalab, M.
    Safari, S.
    IEICE ELECTRONICS EXPRESS, 2007, 4 (23): : 755 - 761
  • [24] REMORA: A Hybrid Low-Cost Soft-Error Reliable Fault Tolerant Architecture
    Gopalakrishnan, Shoba
    Singh, Virendra
    2017 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2017, : 125 - 130
  • [25] IMPROVING AUTONOMOUS SOFT-ERROR TOLERANCE OF FPGA THROUGH LUT CONFIGURATION BIT MANIPULATION
    Das, Anup
    Venkataraman, Shyamsundar
    Kumar, Akash
    2013 23RD INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2013) PROCEEDINGS, 2013,
  • [26] Improving Code Density with Variable Length Encoding Aware Instruction Scheduling
    Kultala, Heikki
    Viitanen, Timo
    Jaaskelainen, Pekka
    Helkala, Janne
    Takala, Jarmo
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 84 (03): : 435 - 446
  • [27] Improving Code Density with Variable Length Encoding Aware Instruction Scheduling
    Heikki Kultala
    Timo Viitanen
    Pekka Jääskeläinen
    Janne Helkala
    Jarmo Takala
    Journal of Signal Processing Systems, 2016, 84 : 435 - 446
  • [28] LEAP: Layout Design through Error-Aware Transistor Positioning for Soft-Error Resilient Sequential Cell Design
    Lee, Hsiao-Heng Kelin
    Lilja, Klas
    Bounasser, Mounaim
    Relangi, Prasanthi
    Linscott, Ivan R.
    Inan, Umran S.
    Mitra, Subhasish
    2010 INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM, 2010, : 203 - 212
  • [29] GPU Age-Aware Scheduling to Improve the Reliability of Leadership Jobs on Titan
    Zimmer, Christopher
    Maxwell, Don
    McNally, Stephen
    Atchley, Scott
    Vazhkudai, Sudharshan S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE, AND ANALYSIS (SC'18), 2018,
  • [30] Cost-Efficient Scheduling in High-Level Synthesis for Soft-Error Vulnerability Mitigation
    Hara-Azumi, Yuko
    Tomiyama, Hiroyuki
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2013), 2013, : 502 - 507