Optimizing image processing on multi-core CPUs with Intel parallel programming technologies

被引:0
|
作者
Cheong Ghil Kim
Jeom Goo Kim
Do Hyeon Lee
机构
[1] Namseoul University,Department of Computer Science
[2] Namseoul University,IT Convergence Technology Research & Education Center
来源
Multimedia Tools and Applications | 2014年 / 68卷
关键词
Multi-core; Streaming SIMD extension; Threading building block; Sobel operator; Sub-word parallelism; Task-level parallelism; Multimedia;
D O I
暂无
中图分类号
学科分类号
摘要
The rapid advance of computer hardware and popularity of multimedia applications enable multi-core processors with sub-word parallelism instructions to become a dominant market trend in desk-top PCs as well as high end mobile devices. This paper presents an efficient parallel implementation of 2D convolution algorithm demanding high performance computing power in multi-core desktop PCs. It is a representative computation intensive algorithm, in image and signal processing applications, accompanied by heavy memory access; on the other hand, their computational complexities are relatively low. The purpose of this study is to explore the effectiveness of exploiting the streaming SIMD (Single Instruction Multiple Data) extension (SSE) technology and TBB (Threading Building Block) run-time library in Intel multi-core processors. By doing so, we can take advantage of all the hardware features of multi-core processor concurrently for data- and task-level parallelism. For the performance evaluation, we implemented a 3 × 3 kernel based convolution algorithm using SSE2 and TBB with different combinations and compared their processing speeds. The experimental results show that both technologies have a significant effect on the performance and the processing speed can be greatly improved when using two technologies at the same time; for example, 6.2, 6.1, and 1.4 times speedup compared with the implementation of either of them are suggested for 256 × 256, 512 × 512, and 1024 × 1024 data sets, respectively.
引用
收藏
页码:237 / 251
页数:14
相关论文
共 50 条
  • [21] Parallel Geometric Algorithms for Multi-Core Computers
    Batista, Vicente H. F.
    Millman, David L.
    Pion, Sylvain
    Singler, Johannes
    PROCEEDINGS OF THE TWENTY-FIFTH ANNUAL SYMPOSIUM ON COMPUTATIONAL GEOMETRY (SCG'09), 2009, : 217 - 226
  • [22] Parallel Implementation of Xvid Decoder on Multi-Core
    Liu, Ying
    Gao, Fuxiang
    Wang, Shiyuan
    JOURNAL OF COMPUTERS, 2012, 7 (07) : 1639 - 1646
  • [23] Multi-core Parallel Processing Technique for Short-circuit Computation of Power Systems
    Feng, Zhongnan
    Lin, Xiangning
    Sui, Quan
    Li, Zhengtian
    2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT GREEN BUILDING AND SMART GRID (IGBSG 2019), 2019, : 440 - 442
  • [24] Parallel modular multiplication on multi-core processors
    Giorgi, Pascal
    Imbert, Laurent
    Izard, Thomas
    2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2013, : 135 - 142
  • [25] Parallel XML transformations on multi-core processors
    Sun, Yuanhao
    Li, Tianyou
    Zhang, Qi
    Yang, Jia
    Liao, Shih-wei
    ICEBE 2007: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2007, : 701 - 708
  • [26] An Efficient Parallel PathStack Algorithm for Processing XML Twig Queries on Multi-core Systems
    Feng, Jianhua
    Liu, Le
    Li, Guoliang
    Li, Jianhui
    Sun, Yuanhao
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 277 - 291
  • [27] Multi-core Programming Model based on Aspect-Oriented Programming
    Tong, Li
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 6175 - 6178
  • [28] Optimizing Multi-Dimensional Packet Classification for Multi-Core Systems
    Shen, Tong
    Zhang, Da-Fang
    Xie, Gao-Gang
    Zhang, Xin-Yi
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (05) : 1056 - 1071
  • [29] Optimizing Multi-Dimensional Packet Classification for Multi-Core Systems
    Tong Shen
    Da-Fang Zhang
    Gao-Gang Xie
    Xin-Yi Zhang
    Journal of Computer Science and Technology, 2018, 33 : 1056 - 1071
  • [30] Parallel test tasks scheduling on multi-core platform
    Wang Lei
    Fang Jia-Yong
    Gao Cheng-jin
    2008 IEEE AUTOTESTCON, VOLS 1 AND 2, 2008, : 28 - 31