A Multi-GPU Framework for In-Memory Text Data Analytics

被引:4
作者
Chong, Poh Kit [1 ]
Karuppiah, Ettikan K. [1 ]
Yong, Keh Kok [1 ]
机构
[1] MIMOS Berhad, Kuala Lumpur, Malaysia
来源
2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA) | 2013年
关键词
GPU; text processing; sorting; matching; inmemory; text analysis; framework;
D O I
10.1109/WAINA.2013.238
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Current application of GPU processors for parallel computing tasks show excellent results in terms of speed-ups compared to CPU processors. However, there is no existing framework that enables automatic distribution of data and processing across multiple GPUs, modularity of kernel design, and efficient co-usage of CPU and GPU processors. All these elements are necessary conditions to enable users to easily perform 'Big Data' analysis, and to create their own modules for their desired processing functionality. We propose a framework for in-memory 'Big Text Data' analytics that provides mechanisms for automatic data segmentation, distribution, execution, and result retrieval across multiple cards (CPU, GPU & FPGA) and machines, and a modular design for easy addition of new GPU kernels. The architecture and components of the framework such as multi-card data distribution and execution, data structures for efficient memory access, algorithms for parallel GPU computation, and result retrieval are described in detail, and some of the kernels in the framework are evaluated using Big Data versus multi-core CPUs to demonstrate the performance and feasibility of using it for 'Big Data' analytics, providing alternative and cheaper HPC solution.
引用
收藏
页码:1411 / 1416
页数:6
相关论文
共 9 条
[1]  
[Anonymous], 2008, HKUSTCS0807
[2]  
[Anonymous], 2008, COMMUNICATIONS ACM
[3]  
Bakkum P., 2010, Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units - GPGPU '10 (New York, New York, USA, 2010), P94, DOI DOI 10.1145/1735688.1735706
[4]  
Gregg C, 2011, INT SYM PERFORM ANAL, P134, DOI 10.1109/ISPASS.2011.5762730
[5]   Mars: A MapReduce Framework on Graphics Processors [J].
He, Bingsheng ;
Fang, Wenbin ;
Luo, Qiong ;
Govindaraju, Naga K. ;
Wang, Tuyong .
PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, :260-269
[6]  
Holton G. A., VALUE RISK THEORY PR
[7]  
Munshi A., 2009, OPENCL SPECIFICATION
[8]  
Nvidia Corporation, 2011, NVIDIA CUDA C PROGR
[9]  
Wolfe M., 2010, ACM 3 WORKSH GEN PUR, P43