Energy cost evaluation of parallel algorithms for multiprocessor systems

被引:11
作者
Wang, Zhuowei [1 ]
Xu, Xianbin [1 ]
Xiong, Naixue [2 ]
Yang, Laurence T. [3 ]
Zhao, Wuqing [1 ]
机构
[1] Wuhan Univ, Sch Comp, Wuhan 430000, Peoples R China
[2] Georgia State Univ, Dept Comp Sci, Atlanta, GA 30303 USA
[3] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 1C0, Canada
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2013年 / 16卷 / 01期
关键词
GPUs; Parallel algorithms; Energy scalability; Energy conservation; Performance; GPU;
D O I
10.1007/s10586-011-0188-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the continuous development of hardware and software, Graphics Processor Units (GPUs) have been used in the general-purpose computation field. They have emerged as a computational accelerator that dramatically reduces the application execution time with CPUs. To achieve high computing performance, a GPU typically includes hundreds of computing units. The high density of computing resource on a chip brings in high power consumption. Therefore power consumption has become one of the most important problems for the development of GPUs. This paper analyzes the energy consumption of parallel algorithms executed in GPUs and provides a method to evaluate the energy scalability for parallel algorithms. Then the parallel prefix sum is analyzed to illustrate the method for the energy conservation, and the energy scalability is experimentally evaluated using Sparse Matrix-Vector Multiply (SpMV). The results show that the optimal number of blocks, memory choice and task scheduling are the important keys to balance the performance and the energy consumption of GPUs.
引用
收藏
页码:77 / 90
页数:14
相关论文
共 34 条
[11]  
BUCK I, 2004, P 1 ACM WORKSH GEN P, pC20
[12]   Design issues for dynamic voltage scaling [J].
Burd, TD ;
Brodersen, RW .
ISLPED '00: PROCEEDINGS OF THE 2000 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2000, :9-14
[13]   LOW-POWER CMOS DIGITAL DESIGN [J].
CHANDRAKASAN, AP ;
SHENG, S ;
BRODERSEN, RW .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1992, 27 (04) :473-484
[14]  
CHATTERJEE S, 1990, SUPERCOMPUTING 90, P666
[15]  
CUDDP, 2011, CUDA DAT PAR PRIM LI
[16]  
Dotsenko Y, 2008, ICS'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, P205
[17]  
Fatahaian K., 2004, P 19 EUR SIGGRAPH GR
[18]   Exploring weak scalability for FEM calculations on a GPU-enhanced cluster [J].
Goeddeke, Dominik ;
Strzodka, Robert ;
Mohd-Yusof, Jamaludin ;
McCormick, Patrick ;
Buijssen, Sven H. M. ;
Grajewski, Matthias ;
Turek, Stefan .
PARALLEL COMPUTING, 2007, 33 (10-11) :685-699
[19]  
Govindaraju N., 2006, SC
[20]  
Harri M., 2007, GPU GEMS