Preliminary study on the automatic parallelism optimization model for image enhancement algorithms based on Intel's® Xeon Phi

被引:0
作者
Huang, Fang [1 ]
Yang, Hao [1 ]
Tao, Jian [2 ]
Wang, Jian [3 ]
Tan, Xicheng [4 ]
机构
[1] Univ Elect Sci & Technol China UESTC, Sch Recourses & Environm, Chengdu 611731, Peoples R China
[2] Texas A&M Univ, Texas A&M Engn Expt Stn TEES, College Stn, TX USA
[3] Chinese Acad Sci, Aerosp Informat Res Inst AIR, Beijing, Peoples R China
[4] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan, Peoples R China
基金
美国国家科学基金会; 国家重点研发计划;
关键词
automatic parallelism; image‐ enhancement algorithms; Par4All; Intel® Xeon Phi; unmanned aerial vehicles; IMPLEMENTATION; OPENCL;
D O I
10.1002/cpe.6260
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In unmanned aerial vehicle (UAV) image-processing applications, one needs to implement different parallel image-enhancement algorithms on several high-performance computing platforms utilizing various programming models. To speed up the parallelization procedure and improve its efficiency, the automatic parallel software package, Par4All, is applied in this work. We find that the performance of the original automatic parallelization algorithm produced with Par4All is inefficient. To resolve this problem, we propose different optimization approaches for Par4All based on Intel (R)'s Xeon Phi high-performance computing platform that are based on the structural features of the image-enhancement algorithms, which can further optimize the original parallel algorithm. These approaches mainly include: (1) Par4All automatic parallel search module optimization, (2) dynamic thread setting optimization, and (3) the collaborative parallelization of both CPU and many integrated core (MIC) processors. According to the results of the comparison experiments involving different algorithms, it is shown that the proposed optimization approaches for these kinds of algorithms can significantly improve the performance of automatic parallel algorithms. The acceleration ratio increases approximately by 30%, 70%, and 80% for the multiscale Retinex, Gaussian-filtering and median-filtering algorithms, respectively. As continuation and deepening of our previous research work, this research has the potential to be beneficial for other researchers in image-processing applications with image-enhancement algorithms.
引用
收藏
页数:14
相关论文
共 38 条
  • [1] Alyahya H., 2017, P INT C SMART CIT IN, P306
  • [2] Amini M., 2012, P 2 INT WORKSH POL C
  • [3] Amini M., 2012, P EMB WORLD C NUR GE
  • [4] Ashraf Muhammad Usman, 2016, International Journal of Modern Education and Computer Science, V8, P27, DOI 10.5815/ijmecs.2016.06.04
  • [5] Large-Scale Parallel Method of Moments on CPU/MIC Heterogeneous Clusters
    Chen, Yan
    Zuo, Sheng
    Zhang, Yu
    Zhao, Xunwang
    Zhang, Huanhuan
    [J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2017, 65 (07) : 3782 - 3787
  • [6] Fang J., 2020, CCF T HIGH PERFORM C, V2, P382
  • [7] High Performance Computing of Fast Independent Component Analysis for Hyperspectral Image Dimensionality Reduction on MIC-based Clusters
    Fang, Minquan
    Yu, Yi
    Zhang, Weimin
    Wu, Heng
    Deng, Mingzhu
    Fang, Jianbin
    [J]. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, 2015, : 138 - 145
  • [8] Han T.D., 2009, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units,, GPGPU-2, P52
  • [9] Source-to-Source Parallelization Compilers for Scientific Shared-Memory Multi-core and Accelerated Multiprocessing: Analysis, Pitfalls, Enhancement and Potential
    Hare, Re'em
    Mosseri, Idan
    Levin, Harel
    Alon, Lee-or
    Rusanovsky, Matan
    Oren, Gal
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2020, 48 (01) : 1 - 31
  • [10] Heinecke A., 2011, P EUR C PAR PROC BOR, P375