CUDA-enabled Hadoop Cluster for Sparse Matrix Vector Multiplication

被引：0

作者：

Reza, Motahar ^{[1
]}

Sinha, Aman ^{[1
]}

Nag, Rajkumar ^{[1
]}

Mohanty, Prasant ^{[1
]}

机构：

[1] Natl Inst Sci & Technol, Sch Comp Sci & Engn, High Performance Comp Lab, Berhampur 761008, Orissa, India

来源：

2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS) | 2015年

关键词：

Hadoop; MapReduce; SpMV; CUDA; SCOO; GPGPU;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Compute Unified Device Architecture (CUDA) is an architecture and programming model that allows leveraging the high compute-intensive processing power of the Graphical Processing Units (GPUs) to perform general, non-graphical tasks in a massively parallel manner. Hadoop is an open-source software framework that has its own file system, the Hadoop Distributed File System (HDFS), and its own programming model, the Map Reduce, in order to accomplish the tasks of storage of very large amount of data and their fast processing in a distributed manner in a cluster of inexpensive hardware. This paper presents a model and implementation of a Hadoop-CUDA Hybrid approach to perform Sparse Matrix Vector Multiplication(SpMV) of very large matrices in a very high performing manner. Hadoop is used for splitting the input matrix into smaller sub-matrices, storing them on individual data nodes and then invoking the required CUDA kernels on the individual GPU-possessing cluster nodes. The original SpMV is done using CUDA. Such an implementation has been seen to improve the performance of the SpMV operation over very large matrices by speedup of around 1.4 in comparison to non-Hadoop, single-GPU CUDA implementation.

引用

页码：169 / 172

页数：4

共 50 条

[41] Adaptive sparse matrix representation for efficient matrix-vector multiplication
Zardoshti, Pantea
Khunjush, Farshad
Sarbazi-Azad, Hamid
JOURNAL OF SUPERCOMPUTING, 2016, 72 (09) : 3366 - 3386
[42] Correlation acceleration in GNSS software receivers using a CUDA-enabled GPU
Liangchun Xu
Nesreen I. Ziedan
Xiaoji Niu
Wenfei Guo
GPS Solutions, 2017, 21 : 225 - 236
[43] Parallelization of a Video Segmentation Algorithm on CUDA-Enabled Graphics Processing Units
Gomez-Luna, Juan
Maria Gonzalez-Linares, Jose
Ignacio Benavides, Jose
Guil, Nicolas
EURO-PAR 2009: PARALLEL PROCESSING, PROCEEDINGS, 2009, 5704 : 924 - +
[44] Correlation acceleration in GNSS software receivers using a CUDA-enabled GPU
Xu, Liangchun
Ziedan, Nesreen I.
Niu, Xiaoji
Guo, Wenfei
GPS SOLUTIONS, 2017, 21 (01) : 225 - 236
[45] CUDA-enabled implementation of a neural network algorithm for handwritten digit recognition
Izotov P.Y.
Kazanskiy N.L.
Golovashkin D.L.
Sukhanov S.V.
Optical Memory and Neural Networks, 2011, 20 (2) : 98 - 106
[46] Accelerating universal Kriging interpolation algorithm using CUDA-enabled GPU
Cheng, Tangpei
COMPUTERS & GEOSCIENCES, 2013, 54 : 178 - 183
[47] Auto-tuning of Sparse Matrix-Vector Multiplication on Graphics Processors
Abu-Sufah, Walid
Karim, Asma Abdel
SUPERCOMPUTING (ISC 2013), 2013, 7905 : 151 - 164
[48] Parallel Sparse Matrix-Vector Multiplication Using Accelerators
Maeda, Hiroshi
Takahashi, Daisuke
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II, 2016, 9787 : 3 - 18
[49] A New Method of Sparse Matrix-Vector Multiplication on GPU
Huan, Gao
Qian, Zhang
PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 954 - 958
[50] Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU
Zhang, Jilin
Liu, Enyi
Wan, Jian
Ren, Yongjian
Yue, Miao
Wang, Jue
APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 473 - 482

← 1 2 3 4 5 →