Removing Host Interventions from GPU Accelerated Neural Network

被引:0
|
作者
Yoo, Jeongjoon [1 ]
Oh, Kyongjoo [1 ]
Jun, Jaehee [1 ]
Cho, Hoonhee [1 ]
Kim, Kyeongmin [1 ]
机构
[1] Samsung Elect, Suwon, South Korea
来源
2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE | 2023年
关键词
OpenCL; GPU; Neural Network; Performance; Delegator; Host intervention;
D O I
10.1109/ICCE56470.2023.10043523
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we present a novel scheme removing host interventions from OpenCL execution in GPUaccelerated neural network. Our scheme must satisfy two folds; i) remove host interventions between CPU and GPU interactions, ii) use previous OpenCL kernels without modification. To do so, we propose a Delegator-based OpenCL execution method providing a good solution in such situation. Experimental result shows that our scheme reduces execution latency into 0.43.
引用
收藏
页数:2
相关论文
共 50 条
  • [41] Visualization and GPU-accelerated simulation of medical ultrasound from CT images
    Kutter, Oliver
    Shams, Ramtin
    Navab, Nassir
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2009, 94 (03) : 250 - 266
  • [42] GPU accelerated segmentation and centerline extraction of tubular structures from medical images
    Erik Smistad
    Anne C. Elster
    Frank Lindseth
    International Journal of Computer Assisted Radiology and Surgery, 2014, 9 : 561 - 575
  • [43] GPU accelerated segmentation and centerline extraction of tubular structures from medical images
    Smistad, Erik
    Elster, Anne C.
    Lindseth, Frank
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2014, 9 (04) : 561 - 575
  • [44] A GPU-Based Training of BP Neural Network for Healthcare Data Analysis
    Song, Wei
    Zou, Shuanghui
    Tian, Yifei
    Fong, Simon
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 193 - 198
  • [45] Optimization and Analysis of Parallel Back Propagation Neural Network on GPU Using CUDA
    Wang, Yaobin
    Tang, Pingping
    An, Hong
    Liu, Zhiqin
    Wang, Kun
    Zhou, Yong
    NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 156 - 163
  • [46] SNICIT: Accelerating Sparse Neural Network Inference via Compression at Inference Time on GPU
    Jiang, Shui
    Huang, Tsung-Wei
    Yu, Bei
    Ho, Tsung-Yi
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 51 - 61
  • [47] GPU Implementation of Neural-Network Simulations based on Adaptive-Exponential Models
    Neofytou, Alexandros
    Chatzikostantis, George
    Magkanaris, Ioannis
    Smaragdos, George
    Strydis, Christos
    Soudris, Dimitrios
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 339 - 343
  • [48] Efficient GPU-Accelerated Extraction of Imperfect Inverted Repeats from DNA Sequences
    Baskett, William
    Spencer, Matthew
    Shyu, Chi-Ren
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 516 - 520
  • [49] G-NetMon: A GPU-accelerated Network Performance Monitoring System for Large Scale Scientific Collaborations
    Wu, Wenji
    DeMar, Phil
    Holmgren, Don
    Singh, Amitoj
    Pordes, Ruth
    2011 IEEE 36TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2011, : 195 - 198
  • [50] Efficient approximation of neural filters for removing quantum noise from images
    Suzuki, K
    Horiba, I
    Sugie, N
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (07) : 1787 - 1799