CUDA-enabled Hadoop Cluster for Sparse Matrix Vector Multiplication

被引:0
|
作者
Reza, Motahar [1 ]
Sinha, Aman [1 ]
Nag, Rajkumar [1 ]
Mohanty, Prasant [1 ]
机构
[1] Natl Inst Sci & Technol, Sch Comp Sci & Engn, High Performance Comp Lab, Berhampur 761008, Orissa, India
来源
2015 IEEE 2ND INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION SYSTEMS (RETIS) | 2015年
关键词
Hadoop; MapReduce; SpMV; CUDA; SCOO; GPGPU;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Compute Unified Device Architecture (CUDA) is an architecture and programming model that allows leveraging the high compute-intensive processing power of the Graphical Processing Units (GPUs) to perform general, non-graphical tasks in a massively parallel manner. Hadoop is an open-source software framework that has its own file system, the Hadoop Distributed File System (HDFS), and its own programming model, the Map Reduce, in order to accomplish the tasks of storage of very large amount of data and their fast processing in a distributed manner in a cluster of inexpensive hardware. This paper presents a model and implementation of a Hadoop-CUDA Hybrid approach to perform Sparse Matrix Vector Multiplication(SpMV) of very large matrices in a very high performing manner. Hadoop is used for splitting the input matrix into smaller sub-matrices, storing them on individual data nodes and then invoking the required CUDA kernels on the individual GPU-possessing cluster nodes. The original SpMV is done using CUDA. Such an implementation has been seen to improve the performance of the SpMV operation over very large matrices by speedup of around 1.4 in comparison to non-Hadoop, single-GPU CUDA implementation.
引用
收藏
页码:169 / 172
页数:4
相关论文
共 50 条
  • [41] Adaptive sparse matrix representation for efficient matrix-vector multiplication
    Zardoshti, Pantea
    Khunjush, Farshad
    Sarbazi-Azad, Hamid
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (09) : 3366 - 3386
  • [42] Correlation acceleration in GNSS software receivers using a CUDA-enabled GPU
    Liangchun Xu
    Nesreen I. Ziedan
    Xiaoji Niu
    Wenfei Guo
    GPS Solutions, 2017, 21 : 225 - 236
  • [43] Parallelization of a Video Segmentation Algorithm on CUDA-Enabled Graphics Processing Units
    Gomez-Luna, Juan
    Maria Gonzalez-Linares, Jose
    Ignacio Benavides, Jose
    Guil, Nicolas
    EURO-PAR 2009: PARALLEL PROCESSING, PROCEEDINGS, 2009, 5704 : 924 - +
  • [44] Correlation acceleration in GNSS software receivers using a CUDA-enabled GPU
    Xu, Liangchun
    Ziedan, Nesreen I.
    Niu, Xiaoji
    Guo, Wenfei
    GPS SOLUTIONS, 2017, 21 (01) : 225 - 236
  • [45] CUDA-enabled implementation of a neural network algorithm for handwritten digit recognition
    Izotov P.Y.
    Kazanskiy N.L.
    Golovashkin D.L.
    Sukhanov S.V.
    Optical Memory and Neural Networks, 2011, 20 (2) : 98 - 106
  • [46] Accelerating universal Kriging interpolation algorithm using CUDA-enabled GPU
    Cheng, Tangpei
    COMPUTERS & GEOSCIENCES, 2013, 54 : 178 - 183
  • [47] Auto-tuning of Sparse Matrix-Vector Multiplication on Graphics Processors
    Abu-Sufah, Walid
    Karim, Asma Abdel
    SUPERCOMPUTING (ISC 2013), 2013, 7905 : 151 - 164
  • [48] Parallel Sparse Matrix-Vector Multiplication Using Accelerators
    Maeda, Hiroshi
    Takahashi, Daisuke
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II, 2016, 9787 : 3 - 18
  • [49] A New Method of Sparse Matrix-Vector Multiplication on GPU
    Huan, Gao
    Qian, Zhang
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 954 - 958
  • [50] Implementing Sparse Matrix-Vector Multiplication with QCSR on GPU
    Zhang, Jilin
    Liu, Enyi
    Wan, Jian
    Ren, Yongjian
    Yue, Miao
    Wang, Jue
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 473 - 482