Removing Host Interventions from GPU Accelerated Neural Network

被引:0
|
作者
Yoo, Jeongjoon [1 ]
Oh, Kyongjoo [1 ]
Jun, Jaehee [1 ]
Cho, Hoonhee [1 ]
Kim, Kyeongmin [1 ]
机构
[1] Samsung Elect, Suwon, South Korea
来源
2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE | 2023年
关键词
OpenCL; GPU; Neural Network; Performance; Delegator; Host intervention;
D O I
10.1109/ICCE56470.2023.10043523
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we present a novel scheme removing host interventions from OpenCL execution in GPUaccelerated neural network. Our scheme must satisfy two folds; i) remove host interventions between CPU and GPU interactions, ii) use previous OpenCL kernels without modification. To do so, we propose a Delegator-based OpenCL execution method providing a good solution in such situation. Experimental result shows that our scheme reduces execution latency into 0.43.
引用
收藏
页数:2
相关论文
共 50 条
  • [31] Novel accelerated methods for convolution neural network with matrix core
    Yijie Guo
    Lu Lu
    Songxiang Zhu
    The Journal of Supercomputing, 2023, 79 : 19547 - 19573
  • [32] Accelerated Parallelizable Neural Network Learning Algorithm for Speech Recognition
    Yu, Dong
    Deng, Li
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2292 - 2295
  • [33] Accelerated gradient learning algorithm for neural network weights update
    Hocenski, Zeljko
    Antunoviae, Mladen
    Filko, Damir
    NEURAL COMPUTING & APPLICATIONS, 2010, 19 (02) : 219 - 225
  • [34] Neuromorphic Neural Network Parallelization on CUDA Compatible GPU for EEG Signal Classification
    Bako, Laszlo
    Kolcsar, Arpad-Zoltan
    Brassal, Sandor-Tihamer
    Marton, Laszlo-Ferenc
    Losonczi, Lajos
    2012 SIXTH UKSIM/AMSS EUROPEAN SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS), 2012, : 359 - 364
  • [35] Research on Stock Forecasting Based on GPU and Complex-Valued Neural Network
    Jia, Lina
    Yang, Bin
    Zhang, Wei
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, PT II, 2018, 10955 : 120 - 128
  • [36] Large-scale neural circuit mapping data analysis accelerated with the graphical processing unit (GPU)
    Shi, Yulin
    Veidenbaum, Alexander V.
    Nicolau, Alex
    Xu, Xiangmin
    JOURNAL OF NEUROSCIENCE METHODS, 2015, 239 : 1 - 10
  • [37] GPU ACCELERATED VIEW SYNTHESIS FROM MULTIPLE RGB-D IMAGES
    Park, Anjin
    Kim, Jinwook
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 573 - 576
  • [38] A Parallel Probabilistic Neural Network ECG Recognition Architecture over GPU Platforms
    Phaudphut, Comdet
    So-In, Chakchai
    Phusomsai, Warintorn
    2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 30 - 36
  • [39] GWVT: A GPU Maritime Vessel Tracker based on the Wisard Weightless Neural Network
    Moreira, Rodrigo da Silva
    Favilla Ebecken Affiliation, Nelson Francisco
    2017 COMPUTING CONFERENCE, 2017, : 738 - 743
  • [40] GPU Accelerated FDTD Solver Algorithm for Radiation from MMIC Passive Components
    Morita, Nagayoshi
    2013 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSC), 2013, : 92 - 97