A Distributed OpenCL Framework using Redundant Computation and Data Replication

被引:0
|
作者
Kim, Junghyun [1 ]
Jo, Gangwon [1 ]
Jung, Jaehoon [1 ]
Kim, Jungwon [1 ]
Lee, Jaejin [1 ]
机构
[1] Seoul Natl Univ, Ctr Manycore Programming, Dept Comp Sci & Engn, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
OpenCL; clusters; heterogeneous computing; programming models; runtime systems; redundant computation; data replication; PERFORMANCE; EFFICIENT; PROGRAMS; CLUSTERS;
D O I
10.1145/2908080.2908094
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Applications written solely in OpenCL or CUDA cannot execute on a cluster as a whole. Most previous approaches that extend these programming models to clusters are based on a common idea: designating a centralized host node and coordinating the other nodes with the host for computation. However, the centralized host node is a serious performance bottleneck when the number of nodes is large. In this paper, we propose a scalable and distributed OpenCL framework called SnuCL-D for large-scale clusters. SnuCL-D's remote device virtualization provides an OpenCL application with an illusion that all compute devices in a cluster are confined in a single node. To reduce the amount of control-message and data communication between nodes, SnuCL-D replicates the OpenCL host program execution and data in each node. We also propose a new OpenCL host API function and a queueing optimization technique that significantly reduce the overhead incurred by the previous centralized approaches. To show the effectiveness of SnuCL-D, we evaluate SnuCL-D with a microbenchmark and eleven benchmark applications on a large-scale CPU cluster and a medium-scale GPU cluster.
引用
收藏
页码:553 / 569
页数:17
相关论文
共 50 条
  • [41] A Framework for Distributed Data Processing
    Kadir Febrer-Hernandez, Jose
    Herrera Semenets, Vitali
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS (CIARP 2019), 2019, 11896 : 566 - 574
  • [42] Efficient distributed skyline computation using dependency-based data partitioning
    Yin, Bo
    Zhou, Siwang
    Lin, Yaping
    Liu, Yonghe
    Hu, Yupeng
    JOURNAL OF SYSTEMS AND SOFTWARE, 2014, 93 : 69 - 83
  • [43] Data Integration Tasks on Heterogeneous Systems Using OpenCL
    Faber, Clayton J.
    Cabrera, Anthony M.
    Booker, Oronde
    Maayan, Gabe
    Chamberlain, Roger D.
    PROCEEDINGS OF THE INTERNATIONAL WORKSHOP ON OPENCL (IWOCL'19), 2019,
  • [44] Acceleration of Ultrasonic Data Compression Using OpenCL on GPU
    Wang, Boyang
    Govindan, Pramod
    Gonnot, Thomas
    Saniie, Jafar
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2015, : 305 - 309
  • [45] An Efficient Data Replication Algorithm for Distributed Systems
    Panda, Sanjaya Kumar
    Naik, Saswati
    INTERNATIONAL JOURNAL OF CLOUD APPLICATIONS AND COMPUTING, 2018, 8 (03) : 60 - 77
  • [46] Data Replication Schemes for a Distributed Storage Scenario
    El-Gorashi, Taisir E. H.
    Elmirghani, Jaafar M. H.
    2010 12TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS (ICTON), 2011,
  • [47] Data replication in a distributed system: A performance study
    Hwang, SY
    Lee, KKS
    Chin, YH
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 1996, 1134 : 708 - 717
  • [48] Replication Data Concepts For Distributed Database Systems
    Welekar, Rashmi
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 344 - 346
  • [49] A distributed redundant real-time data storage mechanism
    Huang, Wen-Jun, 1600, Shanghai Jiaotong University (48):
  • [50] Optimization algorithm of redundant data classification in distributed database scenarios
    Post-Doctoral Mobile Station of Clinical Medicine, Third Xiangya Hospital, Central South University, Changsha Hunan
    410013, China
    不详
    410013, China
    不详
    410128, China
    不详
    510006, China
    Boletin Tecnico, 16 (54-61):