Concurrent kernel execution and interference analysis on GPUs using deep learning approaches

被引：0

作者：

Ayub, Mohammed ^{[1
]}

Helmy, Tarek ^{[1
,2
]}

机构：

[1] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Mail Box 413, Dhahran 31261, Saudi Arabia

[2] King Fahd Univ Petr & Minerals, Interdisciplinary Res Ctr Intelligent Secure Syst, Dhahran, Saudi Arabia

来源：

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES | 2022年 / 34卷 / 10期

关键词：

Concurrent kernel execution; Interference analysis in GPUs; Deep learning; Datasets; Performance evaluation;

D O I：

10.1016/j.jksuci.2022.10.016

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Graphic processing units (GPUs) are becoming gradually popular in large-scale data centers and cloud computing. Sharing a GPU across different applications is an important feature in these platforms to improve resource utilization and system productivity. However, in situations where GPUs are competitively shared, some challenges arise. In this paper, concurrent kernel execution and interference detection are investigated using Extreme Gradient Boosting, Convolutional Neural Network, Gated Recurrent Unit, and Conditional Generative Adversarial Network learning techniques on Tesla P100 and RTX-2080 GPU architectures. The experimental results obtained by applying the deep learning models on the dataset, which consists of four GPU resources: blocks per grid, threads per block, number of registers, and shared memory, show that the Extreme Gradient Boosting and Multi-Channel Convolutional Neural Network models have the promising capability of concurrent kernel execution classification and interference detection. In comparison to the existing work, the proposed models investigated using real and synthesized datasets show clear outperformance and generalization capability in terms of recall and precision.

引用

页码：10193 / 10204

页数：12

共 20 条

[1]

Adhinarayanan V, 2016, INT SYM PERFORM ANAL, P307, DOI 10.1109/ISPASS.2016.7482105

[2]

Aiizadelt N.S., 2020, 25 INT COMPUTER C CO, P1

[3] Using machine learning techniques to analyze the performance of concurrent kernel execution on GPUs [J].

Carvalho, Pablo ;

Clua, Esteban ;

Paes, Aline ;

Bentes, Cristiana ;

Lopes, Bruno ;

Drummond, Lucia Maria de A. .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 113 (113) :528-540

[4]

Cassagne A., 2013, PROJET ETUDE RECHERC, P1

[5]

Eyrdud-Uubois I, 2020, 2020 IEEE 27 INT C H, P192

[6]

Farber R, 2011, CUDA APPLICATION DESIGN AND DEVELOPMENT, P1

[7] HGP4CNN: an efficient parallelization framework for training convolutional neural networks on modern GPUs [J].

Fu, Hao ;

Tang, Shanjiang ;

He, Bingsheng ;

Yu, Ce ;

Sun, Jizhou .

JOURNAL OF SUPERCOMPUTING, 2021, 77 (11) :12741-12770

[8] Characterizing concurrency mechanisms for NVIDIA GPUs under deep learning workloads [J].

Gilman, Guin ;

Walls, Robert J. .

PERFORMANCE EVALUATION, 2021, 151

[9] Generative Adversarial Networks [J].

Goodfellow, Ian ;

Pouget-Abadie, Jean ;

Mirza, Mehdi ;

Xu, Bing ;

Warde-Farley, David ;

Ozair, Sherjil ;

Courville, Aaron ;

Bengio, Yoshua .

COMMUNICATIONS OF THE ACM, 2020, 63 (11) :139-144

[10]

Jiao Q, 2015, IEEEACM INT S CODEGE, P1

← 1 2 →