An On-Line Testing Technique for the Scheduler Memory of a GPGPU

被引:8
|
作者
Carlo, Stefano Di [1 ]
Condia, Josie E. Rodriguez [1 ]
Reorda, Matteo Sonza [1 ]
机构
[1] Politecn Torino, Dept Control & Comp Engn DAUIN, I-10129 Turin, Italy
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Functional test; general purpose graphics processing units (GPGPUs); memory test; software-based self-test (SBST); SELF-TEST; GENERATION;
D O I
10.1109/ACCESS.2020.2968139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The highly parallel processing capabilities and reduced power performance of General Purpose Graphics Processing Units (GPGPUs) have been crucial factors for their massive use in multiple fields, such as multimedia and high-performance computing applications. Nowadays, more demanding areas, such as automotive, employ GPGPU devices where safety and reliability are mandatory design constraints. Nevertheless, the structural complexity, the transistor density, and the implementation in the latest silicon technologies introduce challenges to match safety and reliability requirements. In these technologies, wear-out and aging are factors that may significantly increase the occurrence of permanent faults during the lifetime operation. Moreover, these faults may generate unacceptable misbehaviors during the execution of an application. These constraints require devising new methods for in-field fault detection, thus verifying the integrity and correct behavior of the device during its whole operational life. This work proposes a technique to generate functional self-test programs targeting the detection of permanent static faults in the memory of the warp scheduler of a GPGPU. The proposed technique can translate fault primitives, which represent the effect of faults in a memory cell, into self-test functions and programs composed of a sequence of operations to excite the fault in the memory and to propagate its effects to a visible location, thus detecting its presence. We focused on the memory in the warp scheduler because it represents a crucial module for the device operation. Furthermore, this memory is present in each Streaming Multiprocessor (SM) of a GPGPU. Some experimental results to validate the method have been gathered, resorting to the NVIDIA Visual Profiler and the Nsight Debugger using the NVIDIA-GEFORCE GTX GPU platform and a structural fault simulator. The CUDA programming environment was used to implement the test procedures.
引用
收藏
页码:16893 / 16912
页数:20
相关论文
共 50 条
  • [21] On-line testing in fuzzy controllers
    Kamiura, Naotake
    Hata, Yutaka
    Systems and Computers in Japan, 2001, 32 (08) : 63 - 71
  • [22] A Passive Testing Technique with Minimized On-line Processing For Fault Management of Network Protocols
    Kim, Tae-Hyong
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2007, 7 (03): : 7 - 14
  • [23] A technique for on-line data migration
    Wang, JH
    Miyazaki, M
    Li, J
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01) : 113 - 120
  • [24] AN ON-LINE TECHNIQUE FOR SYSTEM IDENTIFICATION
    HSIA, TC
    VIMOLVANICH, V
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1969, AC14 (01) : 92 - +
  • [25] On-line hemodiafiltration: Technique and therapy
    Ledebo, I
    ADVANCES IN RENAL REPLACEMENT THERAPY, 1999, 6 (02): : 195 - 208
  • [26] On-line hemodiafiltration: Technique and efficiency
    Pedrini, LA
    JOURNAL OF NEPHROLOGY, 2003, 16 : S57 - S63
  • [27] An On-Line Scheduler over Hard Real-Time Communication System
    CHEN Hui XIONG Guangze(School of Computer Science and Engineering
    Journal of Electronic Science and Technology of China, 2003, (01) : 47 - 53
  • [28] Field practices of UHF technique for on-line PD monitoring and site testing of power transformers
    Tang, ZG
    Li, CR
    Huang, XQ
    Wang, W
    Ding, YS
    2004 IEEE PES POWER SYSTEMS CONFERENCE & EXPOSITION, VOLS 1 - 3, 2004, : 1176 - 1182
  • [29] UNIFIED TECHNIQUE FOR ON-LINE TESTING OF DIGITAL CIRCUITS: DELAY AND STUCK-AT FAULT MODELS
    Biswas, S.
    Mukhopadhyay, S.
    Patra, A.
    Sarkar, D.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2008, 17 (06) : 1069 - 1089
  • [30] A technique to predict gate oxide reliability using fast on-line ramped QBD testing
    Mullen, E
    Leveugle, C
    Molyneaux, J
    Prendergast, J
    Suehle, JS
    40TH ANNUAL PROCEEDINGS: INTERNATIONAL RELIABILITY PHYSICS SYMPOSIUM, 2002, : 292 - 297