Benchmarking Optimization Algorithms for Auto-Tuning GPU Kernels

被引:4
作者
Schoonhoven, Richard Arnoud [1 ,2 ]
van Werkhoven, Ben [1 ,3 ]
Batenburg, Kees Joost [1 ,2 ]
机构
[1] Ctr Wiskunde & Informat, Computat Imaging Grp, NL-1098 XG Amsterdam, Netherlands
[2] Leiden Univ, Leiden Inst Adv Comp Sci, NL-2311 EZ Leiden, Netherlands
[3] Netherlands eSci Ctr, NL-1098 XH Amsterdam, Netherlands
基金
荷兰研究理事会;
关键词
Auto-tuning; evolutionary computing; fitness landscape analysis; graphical processing unit (GPU) computing; performance optimization; GLOBAL OPTIMIZATION; SEARCH; IMPLEMENTATION; MODELS;
D O I
10.1109/TEVC.2022.3210654
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed phenomenal growth in the application, and capabilities of graphical processing units (GPUs) due to their high parallel computation power at relatively low cost. However, writing a computationally efficient GPU program (kernel) is challenging and, generally, only certain specific kernel configurations lead to significant increases in performance. Auto-tuning is the process of automatically optimizing software for highly efficient execution on a target hardware platform. Auto-tuning is particularly useful for GPU programming, as a single kernel requires retuning after code changes, for different input data, and for different architectures. However, the discrete and nonconvex nature of the search space creates a challenging optimization problem. In this work, we investigate which algorithm produces the fastest kernels if the time-budget for the tuning task is varied. We conduct a survey by performing experiments on 26 different kernel spaces, from nine different GPUs, for 16 different evolutionary black-box optimization algorithms. We then analyze these results and introduce a novel metric based on the PageRank centrality concept as a tool for gaining insight into the difficulty of the optimization problem. We demonstrate that our metric correlates strongly with the observed tuning performance.
引用
收藏
页码:550 / 564
页数:15
相关论文
共 50 条
  • [31] Auto-tuning elastic applications in production
    Sampaio, Adalberto R., Jr.
    Beschastnikh, Ivan
    Maier, Daryl
    Bourne, Don
    Sundaresen, Vijay
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE, ICSE-SEIP, 2023, : 355 - 367
  • [32] AutoPas: Auto-Tuning for Particle Simulations
    Gratl, Fabio
    Seckler, Steffen
    Tchipev, Nikola
    Bungartz, Hans-Joachim
    Neumann, Philipp
    2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 748 - 757
  • [33] Auto-Tuning of Raw Filters for FPGAs
    Hahn, Tobias
    Wildermann, Stefan
    Teich, Jurgen
    2022 32ND INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2022, : 167 - 175
  • [34] Receding Horizon Optimization Approach to PID Controller Parameters Auto-tuning
    XU Min LI ShaoYuan CAI WenJian Institute of AutomationShanghai Jiaotong UniversityShanghai School of Electrical and Electronic EngineeringNanyang Technological UniversitySingapore
    自动化学报, 2005, (03) : 129 - 133
  • [35] Auto-tuning ejector for refrigeration system
    Wang, Lei
    Liu, Jiapeng
    Zou, Tao
    Du, Jingwei
    Jia, Fengze
    ENERGY, 2018, 161 : 536 - 543
  • [36] A Note on Auto-tuning GEMM for GPUs
    Li, Yinan
    Dongarra, Jack
    Tomov, Stanimire
    COMPUTATIONAL SCIENCE - ICCS 2009, PART I, 2009, 5544 : 884 - 892
  • [37] PI and PID auto-tuning procedure based on simplified single parameter optimization
    Ariel Romero, Julio
    Sanchis, Roberto
    Balaguer, Pedro
    JOURNAL OF PROCESS CONTROL, 2011, 21 (06) : 840 - 851
  • [38] Auto-tuning Multi-GPU High-Fidelity Numerical Simulations for Urban Air Mobility
    Koliogeorgi, Konstantina
    Anagnostopoulos, George
    Zampino, Gerardo
    Sanchis, Marcial
    Vinuesa, Ricardo
    Xydis, Sotirios
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [39] Exploiting Subgraph Similarities for Efficient Auto-tuning of Tensor Programs
    Li, Mingzhen
    Yang, Hailong
    Zhang, Shanjun
    Yu, Fengwei
    Gong, Ruihao
    Liu, Yi
    Luan, Zhongzhi
    Qian, Depei
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 786 - 796
  • [40] An improved auto-tuning scheme for PI controllers
    Mudi, Rajani K.
    Dey, Chanchal
    Lee, Tsu-Tian
    ISA TRANSACTIONS, 2008, 47 (01) : 45 - 52