Automatic tuning to performance modelling of matrix polynomials on multicore and multi-GPU systems

被引：2

作者：

Boratto, Murilo ^{[1
]}

Alonso, Pedro ^{[2
]}

Gimenez, Domingo ^{[3
]}

Lastovetsky, Alexey ^{[4
]}

机构：

[1] Univ Estado Bahia, Nucleo Arquitetura Comp & Sistemas Operacionais, Salvador, BA, Brazil

[2] Univ Politecn Valencia, Dept Sistemas Informat & Comp, Valencia, Spain

[3] Univ Murcia, Dept Sistemas Informat, Murcia, Spain

[4] Univ Coll Dublin, Sch Comp Sci, Heterogeneous Comp Lab, Dublin, Ireland

来源：

JOURNAL OF SUPERCOMPUTING | 2017年 / 73卷 / 01期

关键词：

Automatic tuning; Matrix polynomials; Performance; Multicore; Multi-GPU;

D O I：

10.1007/s11227-016-1694-y

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Automatic tuning methodologies have been used in the design of routines in recent years. The goal of these methodologies is to develop routines which automatically adapt to the conditions of the underlying computational system so that efficient executions are obtained independently of the end-user experience. This paper aims to explore programming routines that can automatically be adapted to the computational system conditions thanks to these automatic tuning methodologies. In particular, we have worked on the evaluation of matrix polynomials on multicore and multi-GPU systems as a target application. This application is very useful for the computation of matrix functions like the sine or cosine but, at the same time, the application is very time consuming since the basic computational kernel, which is the matrix multiplication, is carried out many times. The use of all available resources within a node in an easy and efficient way is crucial for the end user.

引用

页码：227 / 239

页数：13

共 50 条

[21] Parallel Computing Model and Performance Prediction based on Multi-GPU Environments
Wang, Zhuowei
Xu, Xianbin
Zhao, Wuqing
2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTERS IN EDUCATION (ICFCE 2011), VOL I, 2011, : 309 - 312
[22] sputniPIC: an Implicit Particle-in-Cell Code for Multi-GPU Systems
Chien, Steven W. D.
Nylund, Jonas
Bengtsson, Gabriel
Peng, Ivy B.
Podobas, Artur
Markidis, Stefano
2020 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD 2020), 2020, : 149 - 156
[23] Scalable Framework for Mapping Streaming Applications onto Multi-GPU Systems
Huynh, Huynh Phung
Hagiescu, Andrei
Wong, Weng-Fai
Goh, Rick Siow Mong
ACM SIGPLAN NOTICES, 2012, 47 (08) : 1 - 10
[24] High Performance Single and Multi-GPU Acceleration for Diffuse Optical Tomography
Saikia, Manob Jyoti
Kanhirodan, Rajan
2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1320 - 1323
[25] Priority-Based PCIe Scheduling for Multi-Tenant Multi-GPU Systems
Li, Chen
Sun, Yifan
Jin, Lingling
Xu, Lingjie
Cao, Zheng
Fan, Pengfei
Kaeli, David
Ma, Sheng
Guo, Yang
Yang, Jun
IEEE COMPUTER ARCHITECTURE LETTERS, 2019, 18 (02) : 157 - 160
[26] Exploiting Adaptive Data Compression to Improve Performance and Energy-efficiency of Compute Workloads in Multi-GPU Systems
Tavana, Mohammad Khavari
Sun, Yifan
Agostini, Nicolas Bohm
Kaeli, David
2019 IEEE 33RD INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2019), 2019, : 664 - 674
[27] Multi-GPU performance optimization of a computational fluid dynamics code using OpenACC
Xue, Weicheng
Roy, Christoper J.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (05):
[28] Simulation of Information Propagation over Complex Networks: Performance Studies on Multi-GPU
Jin, Jiangming
Turner, Stephen John
Lee, Bu-Sung
Zhong, Jianlong
He, Bingsheng
17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON DISTRIBUTED SIMULATION AND REAL TIME APPLICATIONS (DS-RT 2013), 2013, : 179 - 188
[29] Warp-Aware Adaptive Energy Efficiency Calibration for Multi-GPU Systems
Wang, Zhuowei
Song, Xiaoyu
Cheng, Lianglun
Wan, Hai
Zhao, Wuqing
Wang, Tao
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (05) : 1676 - 1690
[30] Efficient Multi-GPU Shared Memory via Automatic Optimization of Fine-Grained Transfers
Muthukrishnan, Harini
Nellans, David
Lustig, Daniel
Fessler, Jeffrey A.
Wenisch, Thomas F.
2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021), 2021, : 139 - 152

← 1 2 3 4 5 →