A Distributed OpenCL Framework using Redundant Computation and Data Replication

被引:0
|
作者
Kim, Junghyun [1 ]
Jo, Gangwon [1 ]
Jung, Jaehoon [1 ]
Kim, Jungwon [1 ]
Lee, Jaejin [1 ]
机构
[1] Seoul Natl Univ, Ctr Manycore Programming, Dept Comp Sci & Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
OpenCL; clusters; heterogeneous computing; programming models; runtime systems; redundant computation; data replication; PERFORMANCE; EFFICIENT; PROGRAMS; CLUSTERS;
D O I
10.1145/2908080.2908094
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Applications written solely in OpenCL or CUDA cannot execute on a cluster as a whole. Most previous approaches that extend these programming models to clusters are based on a common idea: designating a centralized host node and coordinating the other nodes with the host for computation. However, the centralized host node is a serious performance bottleneck when the number of nodes is large. In this paper, we propose a scalable and distributed OpenCL framework called SnuCL-D for large-scale clusters. SnuCL-D's remote device virtualization provides an OpenCL application with an illusion that all compute devices in a cluster are confined in a single node. To reduce the amount of control-message and data communication between nodes, SnuCL-D replicates the OpenCL host program execution and data in each node. We also propose a new OpenCL host API function and a queueing optimization technique that significantly reduce the overhead incurred by the previous centralized approaches. To show the effectiveness of SnuCL-D, we evaluate SnuCL-D with a microbenchmark and eleven benchmark applications on a large-scale CPU cluster and a medium-scale GPU cluster.
引用
收藏
页码:553 / 569
页数:17
相关论文
共 50 条
  • [21] Exploring Pipe Implementations using an OpenCL Framework for FPGAs
    Mirian, Vincent
    Chow, Paul
    2015 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (FPT), 2015, : 112 - 119
  • [22] iMapReduce: A Distributed Computing Framework for Iterative Computation
    Zhang, Yanfeng
    Gao, Qixin
    Gao, Lixin
    Wang, Cuirong
    JOURNAL OF GRID COMPUTING, 2012, 10 (01) : 47 - 68
  • [23] Data service: Distributed data capture and replication
    Warner, Phillip B.
    Pietrowicz, Stephen R.
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVI, 2007, 376 : 475 - +
  • [24] iMapReduce: A Distributed Computing Framework for Iterative Computation
    Yanfeng Zhang
    Qixin Gao
    Lixin Gao
    Cuirong Wang
    Journal of Grid Computing, 2012, 10 : 47 - 68
  • [25] A framework for replication in data grid
    Bsoul, Mohammad
    2011 International Conference on Networking, Sensing and Control, ICNSC 2011, 2011, : 233 - 235
  • [26] Automatic and scalable data replication manager in distributed computation and storage infrastructure of Cyber-Physical Systems
    Yang Z.
    Bhimani J.
    Wang J.
    Evans D.
    Mi N.
    1600, West University of Timisoara (18): : 291 - 311
  • [27] Distributed sequential computing using mobile code: Moving computation to data
    Pan, L
    Bic, LF
    Dillencourt, MB
    PROCEEDINGS OF THE 2001 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2001, : 77 - 84
  • [28] Verifiable Local Computation on Distributed Data
    Zhang, Liang Feng
    Safavi-Naini, Reihaneh
    Liu, Xiao Wei
    SCC'14: PROCEEDINGS OF THE 2ND INTERNATIONAL WORKSHOP ON SECURITY IN CLOUD COMPUTING, 2014, : 3 - 10
  • [29] Data Replication for Distributed Graph Processing
    Ho, Li-Yung
    Wu, Jan-Jan
    Liu, Pangfeng
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 319 - 326
  • [30] Optimization algorithm of redundant data classification in distributed database
    Chen, Dong
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (02): : 439 - 444