Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems

被引:2
|
作者
Boratto, Murilo [1 ]
Alonso, Pedro [2 ]
Gimenez, Domingo [3 ]
Lastovetsky, Alexey [4 ]
机构
[1] Univ Estado Bahia, Nucleo Arquitetura Comp & Sistemas Operacionais, Salvador, BA, Brazil
[2] Univ Politecn Valencia, Dept Sistemas Informat & Comp, Valencia, Spain
[3] Univ Murcia, Dept Sistemas Informat, Murcia, Spain
[4] Univ Coll Dublin, Sch Comp Sci, Heterogeneous Comp Lab, Dublin, Ireland
来源
JOURNAL OF SUPERCOMPUTING | 2017年 / 73卷 / 01期
关键词
Automatic tuning; Matrix polynomials; Performance; Multicore; Multi-GPU;
D O I
10.1007/s11227-016-1694-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic tuning methodologies have been used in the design of routines in recent years. The goal of these methodologies is to develop routines which automatically adapt to the conditions of the underlying computational system so that efficient executions are obtained independently of the end-user experience. This paper aims to explore programming routines that can automatically be adapted to the computational system conditions thanks to these automatic tuning methodologies. In particular, we have worked on the evaluation of matrix polynomials on multicore and multi-GPU systems as a target application. This application is very useful for the computation of matrix functions like the sine or cosine but, at the same time, the application is very time consuming since the basic computational kernel, which is the matrix multiplication, is carried out many times. The use of all available resources within a node in an easy and efficient way is crucial for the end user.
引用
收藏
页码:227 / 239
页数:13
相关论文
共 50 条
  • [21] Parallel Computing Model and Performance Prediction based on Multi-GPU Environments
    Wang, Zhuowei
    Xu, Xianbin
    Zhao, Wuqing
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTERS IN EDUCATION (ICFCE 2011), VOL I, 2011, : 309 - 312
  • [22] sputniPIC: an Implicit Particle-in-Cell Code for Multi-GPU Systems
    Chien, Steven W. D.
    Nylund, Jonas
    Bengtsson, Gabriel
    Peng, Ivy B.
    Podobas, Artur
    Markidis, Stefano
    2020 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2020), 2020, : 149 - 156
  • [23] Scalable Framework for Mapping Streaming Applications onto Multi-GPU Systems
    Huynh, Huynh Phung
    Hagiescu, Andrei
    Wong, Weng-Fai
    Goh, Rick Siow Mong
    ACM SIGPLAN NOTICES, 2012, 47 (08) : 1 - 10
  • [24] High Performance Single and Multi-GPU Acceleration for Diffuse Optical Tomography
    Saikia, Manob Jyoti
    Kanhirodan, Rajan
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1320 - 1323
  • [25] Priority-Based PCIe Scheduling for Multi-Tenant Multi-GPU Systems
    Li, Chen
    Sun, Yifan
    Jin, Lingling
    Xu, Lingjie
    Cao, Zheng
    Fan, Pengfei
    Kaeli, David
    Ma, Sheng
    Guo, Yang
    Yang, Jun
    IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (02) : 157 - 160
  • [26] Exploiting Adaptive Data Compression to Improve Performance and Energy-efficiency of Compute Workloads in Multi-GPU Systems
    Tavana, Mohammad Khavari
    Sun, Yifan
    Agostini, Nicolas Bohm
    Kaeli, David
    2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 664 - 674
  • [27] Multi-GPU performance optimization of a computational fluid dynamics code using OpenACC
    Xue, Weicheng
    Roy, Christoper J.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
  • [28] Simulation of Information Propagation over Complex Networks: Performance Studies on Multi-GPU
    Jin, Jiangming
    Turner, Stephen John
    Lee, Bu-Sung
    Zhong, Jianlong
    He, Bingsheng
    17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT 2013), 2013, : 179 - 188
  • [29] Warp-Aware Adaptive Energy Efficiency Calibration for Multi-GPU Systems
    Wang, Zhuowei
    Song, Xiaoyu
    Cheng, Lianglun
    Wan, Hai
    Zhao, Wuqing
    Wang, Tao
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (05) : 1676 - 1690
  • [30] Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers
    Muthukrishnan, Harini
    Nellans, David
    Lustig, Daniel
    Fessler, Jeffrey A.
    Wenisch, Thomas F.
    2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 139 - 152