共 2 条
Design of a Hybrid MPI-CUDA Benchmark Suite for CPU-GPU Clusters
被引:5
作者:
Agarwal, Tejaswi
[1
]
Becchi, Michela
[1
]
机构:
[1] Univ Missouri, Columbia, MO 65211 USA
来源:
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14)
|
2014年
关键词:
Benchmark;
CUDA-MPI;
clusters;
GPU;
D O I:
10.1145/2628071.2671423
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
In the last few years, GPUs have become an integral part of HPC clusters. To test these heterogeneous CPU-GPU systems, we designed a hybrid CUDA-MPI benchmark suite that consists of three communication-and compute-intensive applications: Matrix Multiplication (MM), Needleman-Wunsch (NW) and the ADFA compression algorithm [1]. The main goal of this work is to characterize these workloads on CPU-GPU clusters. Our benchmark applications are designed to allow cluster administrators to identify bottlenecks in the cluster, to decide if scaling applications to multiple nodes would improve or decrease overall throughput and to design effective scheduling policies. Our experiments show that inter-node communication can significantly degrade the throughput of communication-intensive applications. We conclude that the scalability of the applications depends primarily on two factors: the cluster configuration and the applications characteristics.
引用
收藏
页码:505 / 506
页数:2
相关论文