HybridTuner: Tuning with Hybrid Derivative-Free Optimization Initialization Strategies

被引:0
作者
Sauk, Benjamin [1 ]
Sahinidis, Nikolaos V. [2 ,3 ]
机构
[1] Carnegie Mellon Univ, Dept Chem Engn, Pittsburgh, PA 15213 USA
[2] Georgia Inst Technol, H Milton Stewart Sch Ind & Syst Engn, Atlanta, GA 30332 USA
[3] Georgia Inst Technol, Sch Chem & Biomol Engn, Atlanta, GA 30332 USA
来源
LEARNING AND INTELLIGENT OPTIMIZATION, LION 15 | 2021年 / 12931卷
基金
美国国家科学基金会;
关键词
Autotuners; Derivative-free optimization; GPU computing; PATTERN SEARCH; ALGORITHMS; SOFTWARE; GEMM;
D O I
10.1007/978-3-030-92121-7_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To utilize the full potential of advanced computer architectures, algorithms often need to be tuned to the architecture being used. We propose two hybrid derivative-free optimization (DFO) methods to maximize the performance of an algorithm after evaluating a small number of possible algorithmic configurations. Our autotuner (a) reduces the execution time of dense matrix multiplication by a factor of 1.4x compared to state-of-the-art autotuners, (b) identifies high-quality tuning parameters within only 5% of the computational effort required by other autotuners and (c) can be applied to any computer architecture.
引用
收藏
页码:379 / 393
页数:15
相关论文
共 45 条
  • [41] XSEDE: Accelerating Scientific Discovery
    Towns, John
    Cockerill, Tim
    Dahan, Maytal
    Foster, Ian
    Gaither, Kelly
    Grimshaw, Andrew
    Hazlewood, Victor
    Lathrop, Scott
    Lifka, Dave
    Peterson, Gregory D.
    Roskies, Ralph
    Scott, J. Ray
    Wilkins-Diehr, Nancy
    [J]. COMPUTING IN SCIENCE & ENGINEERING, 2014, 16 (05) : 62 - 74
  • [42] A particle swarm pattern search method for bound constrained global optimization
    Vaz, A. Ismael F.
    Vicente, Luis N.
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2007, 39 (02) : 197 - 219
  • [43] OSKI: A library of automatically tuned sparse matrix kernels
    Vuduc, R
    Demmel, JW
    Yelick, KA
    [J]. SCIDAC 2005: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2005, 16 : 521 - 530
  • [44] Whaley RC, 2001, PARALLEL COMPUT, V27, P3, DOI 10.1016/S0167-8191(00)00087-9
  • [45] Yuki T., POLYBENCHC 4 2 1