LD: Low-Overhead GPU Race Detection Without Access Monitoring

被引:12
|
作者
Li, Pengcheng [1 ]
Hu, Xiaoyu [1 ]
Chen, Dong [1 ]
Brock, Jacob [1 ]
Luo, Hao [1 ]
Zhang, Eddy Z. [2 ]
Ding, Chen [1 ]
机构
[1] Univ Rochester, POB 270226,CSB Bldg, Rochester, NY 14627 USA
[2] Rutgers State Univ, Dept Comp Sci, 110 Frelinghuysen Rd, Piscataway, NJ 08854 USA
基金
美国国家科学基金会;
关键词
GPU race detection; low overhead; value-based checking; instrumentation-free;
D O I
10.1145/3046678
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Data race detection has become an important problem in GPU programming. Previous designs of CPU racechecking tools are mainly task parallel and incur high overhead on GPUs due to access instrumentation, especially when monitoring many thousands of threads routinely used by GPU programs. This article presents a novel data-parallel solution designed and optimized for the GPU architecture. It includes compiler support and a set of runtime techniques. It uses value-based checking, which detects the races reported in previous work, finds new races, and supports race-free deterministic GPU execution. More important, race checking is massively data parallel and does not introduce divergent branching or atomic synchronization. Its slowdown is less than 5x for over half of the tests and 10x on average, which is orders of magnitude more efficient than the cuda-memcheck tool by Nvidia and the methods that use fine-grained access instrumentation.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] LAK: A Low-Overhead Lock-and-Key Based Schema for GPU Memory Safety
    Zhang, Chaochao
    Hou, Rui
    Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors, 2022, 2022-October : 705 - 713
  • [22] LAK: A Low-Overhead Lock-and-Key Based Schema for GPU Memory Safety
    Zhang, Chaochao
    Hou, Rui
    2022 IEEE 40TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2022), 2022, : 705 - 713
  • [23] STOCK: Stochastic Checkers for Low-overhead Approximate Error Detection
    Gala, Neel
    Venkataramani, Swagath
    Raghunathan, Anand
    Kamakoti, V
    ISLPED '16: PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2016, : 266 - 271
  • [24] Design and Evaluation of GPU-FPX: A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs
    Li, Xinyi
    Laguna, Ignacio
    Fang, Bo
    Swirydowicz, Katarzyna
    Li, Ang
    Gopalakrishnan, Ganesh
    PROCEEDINGS OF THE 32ND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2023, 2023, : 59 - 71
  • [25] A Low-overhead Kernel Object Monitoring Approach for Virtual Machine Introspection
    Zhan, Dongyang
    Li, Huhua
    Ye, Lin
    Zhang, Hongli
    Fang, Binxing
    Du, Xiaojiang
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [26] Increasing the locality of memory access patterns by low-overhead hardware address relocation
    Macii, A
    Macii, E
    Poncino, M
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL V: BIO-MEDICAL CIRCUITS & SYSTEMS, VLSI SYSTEMS & APPLICATIONS, NEURAL NETWORKS & SYSTEMS, 2003, : 385 - 388
  • [27] Low-overhead run-time memory leak detection and recovery
    Tsai, Timothy
    Vaidyanathan, Kalyan
    Gross, Kenny
    12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2006, : 329 - 337
  • [28] Sampling plus DMR: Practical and Low-overhead Permanent Fault Detection
    Nomura, Shuou
    Sinclair, Matthew D.
    Ho, Chen-Han
    Govindaraju, Venkatraman
    de Kruijf, Marc
    Sankaralingam, Karthikeyan
    ISCA 2011: PROCEEDINGS OF THE 38TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, 2011, : 201 - 212
  • [29] Low-overhead memory leak detection using adaptive statistical profiling
    Chilimbi, TM
    Hauswirth, M
    ACM SIGPLAN NOTICES, 2004, 39 (11) : 156 - 164
  • [30] Automatic Low-Overhead Load-Imbalance Detection in MPI Applications
    Arzt, Peter
    Fischler, Yannic
    Lehr, Jan-Patrick
    Bischof, Christian
    EURO-PAR 2021: PARALLEL PROCESSING, 2021, 12820 : 19 - 34