Comparison of analytical and ML-based models for predicting CPU-GPU data transfer time

被引:6
|
作者
Riahi, Ali [1 ]
Savadi, Abdorreza [1 ]
Naghibzadeh, Mahmoud [1 ]
机构
[1] Ferdowsi Univ Mashhad, Comp Engn Dept, Mashhad, Razavi Khorasan, Iran
关键词
GPU; CUDA; Data transfer time prediction; Analytical and ML-based models;
D O I
10.1007/s00607-019-00780-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The overhead of data transfer to the GPU poses a bottleneck for the performance of CUDA programs. The accurate prediction of data transfer time is quite effective in improving the performance of GPU analytical modeling, the prediction accuracy of kernel performance, and the composition of the CPU with the GPU for solving computational problems. For estimating the data transfer time between the CPU and the GPU, the current study employs three machine learning-based models and a new analytical model called lambda-Model. These models run on four GPUs from different NVIDIA architectures and their performance is compared. The practical results show that the lambda-Model is able to anticipate the transmission of large-sized data with a maximum error of 1.643%, which offers better performance than that of machine learning methods. As for the transmission of small-sized data, machine learning-based methods provide better performance and a predicted data transfer time with a maximum error of 4.52%. Consequently, the current study recommends a hybrid model, that is, the lambda-Model for large-sized data and machine learning tools for small-sized data.
引用
收藏
页码:2099 / 2116
页数:18
相关论文
共 6 条
  • [1] Comparison of analytical and ML-based models for predicting CPU–GPU data transfer time
    Ali Riahi
    Abdorreza Savadi
    Mahmoud Naghibzadeh
    Computing, 2020, 102 : 2099 - 2116
  • [2] A CPU-GPU Data Transfer Optimization Approach Based On Code Migration and Merging
    Fu, Cong
    Zhai, Yanlong
    Wang, Zhenhua
    2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, : 23 - 26
  • [3] Performance Improvement of CUDA Applications by Reducing CPU-GPU Data Transfer Overhead
    Sunitha, N., V
    Raju, K.
    Chiplunkar, Niranjan N.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2017, : 211 - 215
  • [4] A Real-time SAR Imaging System Based on CPU-GPU Heterogeneous Platform
    Wu, Yewei
    Chen, Jun
    Zhang, Hongqun
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 461 - 464
  • [5] CPU-GPU Collaborated Computation Models for Biological Sequence Alignment with Mirror-based Work Load Balancing
    Soundarajan, Sanjay
    Salomon, Michelle
    Park, Jin H.
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 240 - 248
  • [6] Multi-core-CPU and GPU-accelerated radiative transfer models based on the discrete ordinate method
    Efremenko, Dmitry S.
    Loyola, Diego G.
    Doicu, Adrian
    Spurr, Robert J. D.
    COMPUTER PHYSICS COMMUNICATIONS, 2014, 185 (12) : 3079 - 3089