FPGA-Based Processor Acceleration for Image Processing Applications

被引:31
作者
Siddiqui, Fahad [1 ]
Amiri, Sam [2 ]
Minhas, Umar Ibrahim [1 ]
Deng, Tiantai [1 ]
Woods, Roger [1 ]
Rafferty, Karen [1 ]
Crookes, Daniel [1 ]
机构
[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland
[2] Coventry Univ, Sch Comp Elect & Maths, Coventry CV1 5FB, W Midlands, England
基金
英国工程与自然科学研究理事会;
关键词
FPGA; hardware acceleration; processor architectures; image processing; heterogeneous computing; INTELLIGENT; SYSTEMS; VISION;
D O I
10.3390/jimaging5010016
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
FPGA-based embedded image processing systems offer considerable computing resources but present programming challenges when compared to software systems. The paper describes an approach based on an FPGA-based soft processor called Image Processing Processor (IPPro) which can operate up to 337 MHz on a high-end Xilinx FPGA family and gives details of the dataflow-based programming environment. The approach is demonstrated for a k-means clustering operation and a traffic sign recognition application, both of which have been prototyped on an Avnet Zedboard that has Xilinx Zynq-7000 system-on-chip (SoC). A number of parallel dataflow mapping options were explored giving a speed-up of 8 times for the k-means clustering using 16 IPPro cores, and a speed-up of 9.6 times for the morphology filter operation of the traffic sign recognition using 16 IPPro cores compared to their equivalent ARM-based software implementations. We show that for k-means clustering, the 16 IPPro cores implementation is 57, 28 and 1.7 times more power efficient (fps/W) than ARM Cortex-A7 CPU, nVIDIA GeForce GTX980 GPU and ARM Mali-T628 embedded GPU respectively.
引用
收藏
页数:22
相关论文
共 29 条
[1]   FPGA-Based Soft-Core Processors for Image Processing Applications [J].
Amiri, Moslem ;
Siddiqui, Fahad Manzoor ;
Kelly, Colm ;
Woods, Roger ;
Rafferty, Karen ;
Bardak, Burak .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 87 (01) :139-156
[2]  
Andryc K, 2013, PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), P230, DOI 10.1109/FPT.2013.6718358
[3]  
[Anonymous], 2013, 2013 INT C HARDW SOF, DOI DOI 10.1109/CODESISSS.2013.6658993
[4]  
[Anonymous], 2014, INVESTOR INFORM PACK
[5]  
[Anonymous], 2014, 2014 IEEE WORKSHOP S, DOI DOI 10.1109/SIPS.2014.6986093
[6]  
Bacon DF, 2013, COMMUN ACM, V56, P56, DOI 10.1145/2436256.2436271
[7]   Dataflow object detection system for FPGA-based smart camera [J].
Bourrasset, Cedric ;
Maggiani, Luca ;
Serot, Jocelyn ;
Berry, Francois .
IET CIRCUITS DEVICES & SYSTEMS, 2016, 10 (04) :280-291
[8]   State-of-the-art in heterogeneous computing [J].
Brodtkorb, Andre R. ;
Dyken, Christopher ;
Hagen, Trond R. ;
Hjelmervik, Jon M. ;
Storaasli, Olaf O. .
SCIENTIFIC PROGRAMMING, 2010, 18 (01) :1-33
[9]   The iDEA DSP Block-Based Soft Processor for FPGAs [J].
Cheah, Hui Yan ;
Brosser, Fredrik ;
Fahmy, Suhaib A. ;
Maskell, Douglas L. .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2014, 7 (03)
[10]   PULP: A Ultra-Low Power Parallel Accelerator for Energy-Efficient and Flexible Embedded Vision [J].
Conti, Francesco ;
Rossi, Davide ;
Pullini, Antonio ;
Loi, Igor ;
Benini, Luca .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 84 (03) :339-354