SunwayImg: A Parallel Image Processing Library for the Sunway Many-Core Processor

被引:0
|
作者
Liu, Rui [1 ,2 ]
Liu, Yi [1 ,2 ]
Zhao, Meiting [1 ]
Song, Kaida [1 ]
Qian, Depei [1 ,2 ]
机构
[1] Beihang Univ, Sino German Joint Software Inst, Beijing 100191, Peoples R China
[2] State Key Lab Math Engn & Adv Comp, Wuxi 214000, Jiangsu, Peoples R China
来源
IEEE ACCESS | 2019年 / 7卷
基金
中国国家自然科学基金;
关键词
Image library; high performance computing; parallel computing; deep neural networks;
D O I
10.1109/ACCESS.2019.2939940
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many big data applications need to process massive images and videos, while the performance of image processing is far from reaching requirements. This paper proposes the SunwayImg, a parallel image processing library, to support image-related applications on the Sunway many-core processor as well as the Sunway TaihuLight supercomputer. The SunwayImg integrates three kinds of image algorithms: fundamental algorithms to support basic image operations on the Sunway processor, widely used image feature extraction algorithms and a typical neural network model DBN. In addition, to parallelize various kinds of image algorithms efficiently on the Sunway processor, we propose a three-tier parallelization strategy as well as fine-grained parallelization inside core-groups. Finally, we accomplish implementation of the SunwayImg and evaluate it on the Sunway TaihuLight supercomputer to verify its effectiveness and performance.
引用
收藏
页码:128555 / 128569
页数:15
相关论文
共 37 条
  • [21] GODSON-T: AN EFFICIENT MANY-CORE PROCESSOR EXPLORING THREAD-LEVEL PARALLELISM
    Fan, Dongrui
    Zhang, Hao
    Wang, Da
    Ye, Xiaochun
    Song, Fenglong
    Li, Guojie
    Sun, Ninghui
    IEEE MICRO, 2012, 32 (02) : 38 - 47
  • [22] Optimized Parallel Implementation of Face Detection Based on Embedded Heterogeneous Many-Core Architecture
    Gao, Fang
    Huang, Zhangqin
    Wang, Shulong
    Ji, Xinrong
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (07)
  • [23] A Scalable Parallel Partition Tridiagonal Solver for Many-Core and Low B/F Processors
    Mitsuda, Tatsuya
    Ono, Kenji
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 860 - 869
  • [24] Research and Optimization of the Winograd-Based Convolutional Algorithm on ShenWei-26010 Many-Core Processor
    Wu Z.
    Jin X.
    An H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (04): : 955 - 972
  • [25] A Real Time Micro-expression Detection System with LBP-TOP on a Many-core Processor
    Soh, Xin Rong
    Baskaran, Vishnu Monn
    Buhari, Adamu Muhammad
    Phan, Raphael C. -W.
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 309 - 315
  • [26] Task Parallel Framework and Its Application in Nested Parallel Algorithms on the SW26010 Many-core Platform
    Sun Q.
    Li L.-S.
    Zhao H.-T.
    Zhao H.
    Wu C.-M.
    Wu, Chang-Mao (changmaowu@foxmail.com), 1600, Chinese Academy of Sciences (32): : 2352 - 2364
  • [27] Fluid-film lubrication computing with many-core processors and graphics processing units
    Wang, Nenzi
    Chen, Hsin-Yi
    Chen, Yu-Wen
    ADVANCES IN MECHANICAL ENGINEERING, 2018, 10 (10)
  • [28] Parallel Optimization Study of 3-Dimensional Eulerian Hydrocode for Many-Core High Performance Computing
    Ji, Jung Hwan
    Park, Sang Won
    Lee, Min Hyung
    TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS B, 2018, 42 (12) : 787 - 796
  • [29] GPU based Parallel Image Processing Library for Embedded Systems
    Cavus, Mustafa
    Sumerkan, Hakki Doganer
    Simsek, Osman Seckin
    Hassan, Hasan
    Yaglikci, Abdullah Giray
    Ergin, Oguz
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS (VISAPP), VOL 1, 2014, : 234 - 241
  • [30] A practical parallel implementation for TDLMS image filter on multi-core processor
    Devrim Akgün
    Journal of Real-Time Image Processing, 2017, 13 : 249 - 260