Comparison of analytical and ML-based models for predicting CPU-GPU data transfer time

被引：6

作者：

Riahi, Ali ^{[1
]}

Savadi, Abdorreza ^{[1
]}

Naghibzadeh, Mahmoud ^{[1
]}

机构：

[1] Ferdowsi Univ Mashhad, Comp Engn Dept, Mashhad, Razavi Khorasan, Iran

来源：

COMPUTING | 2020年 / 102卷 / 09期

关键词：

GPU; CUDA; Data transfer time prediction; Analytical and ML-based models;

D O I：

10.1007/s00607-019-00780-x

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The overhead of data transfer to the GPU poses a bottleneck for the performance of CUDA programs. The accurate prediction of data transfer time is quite effective in improving the performance of GPU analytical modeling, the prediction accuracy of kernel performance, and the composition of the CPU with the GPU for solving computational problems. For estimating the data transfer time between the CPU and the GPU, the current study employs three machine learning-based models and a new analytical model called lambda-Model. These models run on four GPUs from different NVIDIA architectures and their performance is compared. The practical results show that the lambda-Model is able to anticipate the transmission of large-sized data with a maximum error of 1.643%, which offers better performance than that of machine learning methods. As for the transmission of small-sized data, machine learning-based methods provide better performance and a predicted data transfer time with a maximum error of 4.52%. Consequently, the current study recommends a hybrid model, that is, the lambda-Model for large-sized data and machine learning tools for small-sized data.

引用

页码：2099 / 2116

页数：18

共 6 条

[1] Comparison of analytical and ML-based models for predicting CPU–GPU data transfer time
Ali Riahi
Abdorreza Savadi
Mahmoud Naghibzadeh
Computing, 2020, 102 : 2099 - 2116
[2] A CPU-GPU Data Transfer Optimization Approach Based On Code Migration and Merging
Fu, Cong
Zhai, Yanlong
Wang, Zhenhua
2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, : 23 - 26
[3] Performance Improvement of CUDA Applications by Reducing CPU-GPU Data Transfer Overhead
Sunitha, N., V
Raju, K.
Chiplunkar, Niranjan N.
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 211 - 215
[4] A Real-time SAR Imaging System Based on CPU-GPU Heterogeneous Platform
Wu, Yewei
Chen, Jun
Zhang, Hongqun
PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 461 - 464
[5] CPU-GPU Collaborated Computation Models for Biological Sequence Alignment with Mirror-based Work Load Balancing
Soundarajan, Sanjay
Salomon, Michelle
Park, Jin H.
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 240 - 248
[6] Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method
Efremenko, Dmitry S.
Loyola, Diego G.
Doicu, Adrian
Spurr, Robert J. D.
COMPUTER PHYSICS COMMUNICATIONS, 2014, 185 (12) : 3079 - 3089

← 1 →