Removing Host Interventions from GPU Accelerated Neural Network

被引：0

作者：

Yoo, Jeongjoon ^{[1
]}

Oh, Kyongjoo ^{[1
]}

Jun, Jaehee ^{[1
]}

Cho, Hoonhee ^{[1
]}

Kim, Kyeongmin ^{[1
]}

机构：

[1] Samsung Elect, Suwon, South Korea

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE | 2023年

关键词：

OpenCL; GPU; Neural Network; Performance; Delegator; Host intervention;

D O I：

10.1109/ICCE56470.2023.10043523

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

In this paper, we present a novel scheme removing host interventions from OpenCL execution in GPUaccelerated neural network. Our scheme must satisfy two folds; i) remove host interventions between CPU and GPU interactions, ii) use previous OpenCL kernels without modification. To do so, we propose a Delegator-based OpenCL execution method providing a good solution in such situation. Experimental result shows that our scheme reduces execution latency into 0.43.

引用

页数：2

共 50 条

[41] Visualization and GPU-accelerated simulation of medical ultrasound from CT images
Kutter, Oliver
Shams, Ramtin
Navab, Nassir
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2009, 94 (03) : 250 - 266
[42] GPU accelerated segmentation and centerline extraction of tubular structures from medical images
Erik Smistad
Anne C. Elster
Frank Lindseth
International Journal of Computer Assisted Radiology and Surgery, 2014, 9 : 561 - 575
[43] GPU accelerated segmentation and centerline extraction of tubular structures from medical images
Smistad, Erik
Elster, Anne C.
Lindseth, Frank
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2014, 9 (04) : 561 - 575
[44] A GPU-Based Training of BP Neural Network for Healthcare Data Analysis
Song, Wei
Zou, Shuanghui
Tian, Yifei
Fong, Simon
ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, MUE/FUTURETECH 2018, 2019, 518 : 193 - 198
[45] Optimization and Analysis of Parallel Back Propagation Neural Network on GPU Using CUDA
Wang, Yaobin
Tang, Pingping
An, Hong
Liu, Zhiqin
Wang, Kun
Zhou, Yong
NEURAL INFORMATION PROCESSING, PT III, 2015, 9491 : 156 - 163
[46] SNICIT: Accelerating Sparse Neural Network Inference via Compression at Inference Time on GPU
Jiang, Shui
Huang, Tsung-Wei
Yu, Bei
Ho, Tsung-Yi
PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 51 - 61
[47] GPU Implementation of Neural-Network Simulations based on Adaptive-Exponential Models
Neofytou, Alexandros
Chatzikostantis, George
Magkanaris, Ioannis
Smaragdos, George
Strydis, Christos
Soudris, Dimitrios
2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 339 - 343
[48] Efficient GPU-Accelerated Extraction of Imperfect Inverted Repeats from DNA Sequences
Baskett, William
Spencer, Matthew
Shyu, Chi-Ren
2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 516 - 520
[49] G-NetMon: A GPU-accelerated Network Performance Monitoring System for Large Scale Scientific Collaborations
Wu, Wenji
DeMar, Phil
Holmgren, Don
Singh, Amitoj
Pordes, Ruth
2011 IEEE 36TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2011, : 195 - 198
[50] Efficient approximation of neural filters for removing quantum noise from images
Suzuki, K
Horiba, I
Sugie, N
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (07) : 1787 - 1799

← 1 2 3 4 5 →