Performance Improvement of Stencil Computations for Multi-core Architectures based on Machine Learning

被引:7
|
作者
Martinez, Victor [1 ]
Dupros, Fabrice [2 ]
Castro, Marcio [3 ]
Navaux, Philippe [1 ]
机构
[1] Fed Univ Rio Grande do Sul UFRGS, Informat Inst INF, Porto Alegre, RS, Brazil
[2] Bur Rech Geol & Minieres, Orleans, France
[3] Fed Univ Santa Catarina UFSC, Dept Informat & Stat INE, Florianopolis, SC, Brazil
来源
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017) | 2017年 / 108卷
关键词
machine learning; stencil computation; multi-core; performance model;
D O I
10.1016/j.procs.2017.05.164
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stencil computations are the basis to solve many problems related to Partial Differential Equations (PDEs). Obtaining the best performance with such numerical kernels is a major issue as many critical parameters (architectural features, compiler flags, memory policies, multithreading strategies) must be finely tuned. In this context, auto-tuning methods have been extensively used to improve the overall performance. However, the complexity of current architectures and the large number of optimizations to consider reduce the efficiency of this approach. This paper focuses on the use of Machine Learning to predict the performance of stencil kernels on multi-core architectures. Low-level hardware counters (e.g. cache-misses and TLB misses) on a limited number of executions are used to build our predictive model. We have considered two different kernels (7-point Jacobi and seismic wave modelling) to demonstrate the effectiveness of our approach. Our results show that performance can be predicted and that the best input configuration for stencil problems can be obtained by simulations of hardware counters and performance measurements. (C) 2017 The Authors. Published by Elsevier B.V. Peer-review under responsibility of the scientific committee of the International Conference on Computational Science
引用
收藏
页码:305 / 314
页数:10
相关论文
共 50 条
  • [41] Challenges for Timing Analysis of Multi-Core Architectures
    Reineke, Jan
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2017, (248): : 4 - +
  • [42] Impact of thermal constraints on multi-core architectures
    Li, Yingmin
    Lee, Benjamin
    Brooks, David
    Hu, Zhigang
    Skadron, Kevin
    2006 PROCEEDINGS 10TH INTERSOCIETY CONFERENCE ON THERMAL AND THERMOMECHANICAL PHENOMENA IN ELECTRONICS SYSTEMS, VOLS 1 AND 2, 2006, : 132 - +
  • [43] SmartDPM: Machine Learning-Based Dynamic Power Management for Multi-Core Microprocessors
    Manoj, R. D. Sai
    Jantsch, Axel
    Shafique, Muhammad
    JOURNAL OF LOW POWER ELECTRONICS, 2018, 14 (04) : 460 - 474
  • [44] Improvement of Load Balancing Mechanism in Multi-core Architectures over High Availability Technique
    Kurti, Arsen
    Tafa, Igli
    Disha, Aldi
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF NUMERICAL ANALYSIS AND APPLIED MATHEMATICS 2014 (ICNAAM-2014), 2015, 1648
  • [45] Analyzing cross-core performance interference on multi-core processors based on statistical learning
    Zhao, Jia-Cheng
    Cui, Hui-Min
    Feng, Xiao-Bing
    Ruan Jian Xue Bao/Journal of Software, 2013, 24 (11): : 2558 - 2570
  • [46] An Investigation on FPGA Based Energy Profiling of Multi-Core Embedded Architectures
    Marcu, Marius
    Ghenea, Madalin
    Cernazanu-Glavan, Cosmin
    Ionascu, Marian
    23RD IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS CIRCUITS AND SYSTEMS (ICECS 2016), 2016, : 720 - 723
  • [47] BRAM-based function reuse for multi-core architectures in FPGAs
    Exenberger Becker, Pedro H.
    Sartor, Anderson L.
    Brandalero, Marcelo
    Schneider Beck, Antonio C.
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 63 : 237 - 248
  • [48] An events based algorithm for distributing concurrent tasks on multi-core architectures
    Holmes, David W.
    Williams, John R.
    Tilke, Peter
    COMPUTER PHYSICS COMMUNICATIONS, 2010, 181 (02) : 341 - 354
  • [49] Mapping of CNNs on multi-core RRAM-based CIM architectures
    Pelke, Rebecca
    Bosbach, Nils
    Cubero, Jose
    Staudigl, Felix
    Leupers, Rainer
    Joseph, Jan Moritz
    2023 IFIP/IEEE 31ST INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION, VLSI-SOC, 2023, : 201 - 206
  • [50] Parallel techniques for physically based simulation on multi-core processor architectures
    Thomaszewski, Bernhard
    Pabst, Simon
    Blochinger, Wolfgang
    COMPUTERS & GRAPHICS-UK, 2008, 32 (01): : 25 - 40