An OpenCL Framework for Heterogeneous Multicores with Local Memory

被引:0
|
作者
Lee, Jaejin [1 ]
Kim, Jungwon [1 ]
Seo, Sangmin [1 ]
Kim, Seungkyun [1 ]
Park, Jungho [1 ]
Kim, Honggyu [1 ]
Thanh Tuan Dao [1 ]
Cho, Yongjin [1 ]
Seo, Sung Jong
Lee, Seung Hak
Cho, Seung Mo
Song, Hyo Jung
Suh, Sang-Bum
Choi, Jong-Deok
机构
[1] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul 151744, South Korea
来源
PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES | 2010年
关键词
OpenCL; Compilers; Runtime; Software-managed caches; Memory consistency; Work-item coalescing; Preload-poststore buffering;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present the design and implementation of an Open Computing Language (OpenCL) framework that targets heterogeneous accelerator multicore architectures with local memory. The architecture consists of a general-purpose processor core and multiple accelerator cores that typically do not have any cache. Each accelerator core, instead, has a small internal local memory. Our OpenCL runtime is based on software-managed caches and coherence protocols that guarantee OpenCL memory consistency to overcome the limited size of the local memory. To boost performance, the runtime relies on three source-code transformation techniques, work-item coalescing, web-based variable expansion and preload-poststore buffering, performed by our OpenCL C source-to-source translator. Work-item coalescing is a procedure to serialize multiple SPMD-like tasks that execute concurrently in the presence of barriers and to sequentially run them on a single accelerator core. It requires the web-based variable expansion technique to allocate local memory for private variables. Preload-poststore buffering is a buffering technique that eliminates the overhead of software cache accesses. Together with work-item coalescing, it has a synergistic effect on boosting performance. We show the effectiveness of our OpenCL framework, evaluating its performance with a system that consists of two Cell BE processors. The experimental result shows that our approach is promising.
引用
收藏
页码:193 / 204
页数:12
相关论文
共 50 条
  • [31] Methods for Optimizing OpenCL Applications on Heterogeneous Multicore Architectures
    Chu, Slo-Li
    Hsiao, Chih-Chieh
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (06): : 2549 - 2562
  • [32] Fuzzy classification of OpenCL programs targeting heterogeneous systems
    Al-Zoubi, Ahmad
    Tatas, Konstantinos
    Kyriacou, Costas
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7189 - 7202
  • [33] Heterogeneous acceleration of volumetric JPEG 2000 using OpenCL
    Cornelis, Jan G.
    Lemeire, Jan
    Bruylants, Tim
    Schelkens, Peter
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2017, 31 (03) : 229 - 245
  • [34] Parallel LDPC Decoding on a Heterogeneous Platform using OpenCL
    Hong, Jung-Hyun
    Park, Joo-Yul
    Chung, Ki-Seok
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (06): : 2648 - 2668
  • [35] FlinkCL: An OpenCL-Based In-Memory Computing Architecture on Heterogeneous CPU-GPU Clusters for Big Data
    Chen, Cen
    Li, Kenli
    Ouyang, Aijia
    Li, Keqin
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (12) : 1765 - 1779
  • [36] AdaMD: Adaptive Mapping and DVFS for Energy-Efficient Heterogeneous Multicores
    Basireddy, Karunakar R.
    Singh, Amit Kumar
    Al-Hashimi, Bashir M.
    Merrett, Geoff V.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (10) : 2206 - 2217
  • [37] Portable Mapping of Data Parallel Programs to OpenCL for Heterogeneous Systems
    Grewe, Dominik
    Wang, Zheng
    O'Boyle, Michael F. P.
    PROCEEDINGS OF THE 2013 IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO), 2013, : 161 - 170
  • [38] A Heterogeneous Implementation of the Sobel Edge Detection Filter Using OpenCL
    Sanida, Theodora
    Sideris, Argyrios
    Dasygenis, Minas
    2020 9TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2020,
  • [39] POSTER: Automated Code Acceleration Targeting Heterogeneous OpenCL Devices
    Riebler, Heinrich
    Vaz, Gavin
    Kenter, Tobias
    Plessl, Christian
    ACM SIGPLAN NOTICES, 2018, 53 (01) : 417 - 418
  • [40] POSTER: Automated Code Acceleration Targeting Heterogeneous OpenCL Devices
    Riebler H.
    Vaz G.
    Kenter T.
    Plessl C.
    ACM SIGPLAN Notices, 2018, 53 (01): : 417 - 418