Can search algorithms save large-scale automatic performance tuning?

被引:20
作者
Balaprakash, Prasanna [1 ]
Wild, Stefan M. [1 ]
Hovland, Paul D. [1 ]
机构
[1] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
来源
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS) | 2011年 / 4卷
关键词
autotuning; empirical tuning; optimization; performance-tuning; OPTIMIZATION;
D O I
10.1016/j.procs.2011.04.234
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Empirical performance optimization of computer codes using autotuners has received significant attention in recent years. Given the increased complexity of computer architectures and scientific codes, evaluating all possible code variants is prohibitively expensive for all but the simplest kernels. One way for autotuners to overcome this hurdle is through use of a search algorithm that finds high-performing code variants while examining relatively few variants. In this paper we examine the search problem in autotuning from a mathematical optimization perspective. As an illustration of the power and limitations of this optimization, we conduct an experimental study of several optimization algorithms on a number of linear algebra kernel codes. We find that the algorithms considered obtain performance gains similar to the optimal ones found by complete enumeration or by large random searches but in a tiny fraction of the computation time.
引用
收藏
页码:2136 / 2145
页数:10
相关论文
共 22 条
  • [11] OPTIMIZATION BY DIRECT SEARCH IN MATRIX COMPUTATIONS
    HIGHAM, NJ
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1993, 14 (02) : 317 - 333
  • [12] ParamILS: An Automatic Algorithm Configuration Framework
    Hutter, Frank
    Hoos, Holger H.
    Leyton-Brown, Kevin
    Stuetzle, Thomas
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 36 : 267 - 306
  • [13] BENCHMARKING DERIVATIVE-FREE OPTIMIZATION ALGORITHMS
    More, Jorge J.
    Wild, Stefan M.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2009, 20 (01) : 172 - 191
  • [14] Norris B, 2008, CH CRC COMP SCI SER, P443
  • [15] Norris B, 2009, LECT NOTES COMPUT SC, V5544, P248, DOI 10.1007/978-3-642-01970-8_25
  • [16] Automatic tuning of whole applications using direct search and a performance-based transformation system
    Qasem, Apan
    Kennedy, Ken
    Mellor-Crummey, John
    [J]. JOURNAL OF SUPERCOMPUTING, 2006, 36 (02) : 183 - 196
  • [17] A Comparison of Search Heuristics for Empirical Code Optimization
    Seymour, Keith
    You, Haihang
    Dongarra, Jack
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, 2008, : 421 - 429
  • [18] Shin J., 2009, P 4 INT WORKSH AUT P
  • [19] Tabatabaee V., 2005, P 2005 ACM IEEE C SU
  • [20] Tiwari A., 2009, MILITARY COMMUNICATI, P1