An FPGA implementation of Information Theoretic Visual-Saliency System and Its Optimization

被引:14
作者
Bae, Sunbmin [1 ]
Cho, Yong Cheol Peter [1 ]
Park, Sungho [1 ]
Irick, Kevin M. [1 ]
Jin, Yongseok [1 ]
Narayanan, Vijaykrishnan [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, Microsyst Design Lab MDL, University Pk, PA 16802 USA
来源
2011 IEEE 19TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM) | 2011年
关键词
ATTENTION;
D O I
10.1109/FCCM.2011.41
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Biological vision systems use saliency-based visual attention mechanisms to limit higher-level vision processing on the most visually-salient subsets of an input image. Among several computational models that capture the visual-saliency in biological system, an information theoretic AIM (Attention based on Information Maximization) algorithm has been demonstrated to predict human gaze patterns better than other existing models. We present an FPGA based implementation of this computationally intensive AIM algorithm to support embedded vision applications. Our implementation provides performance of processing about 4M pixels/sec for 25 basis functions with a convolution kernel size of 21 by 21 for each of the R, G, and B color-channels, when implemented on a Virtex-6LX240T. We also provide an optimization aimed at controlling the trade-off between power consumption and latency, and performance comparisons with a GPU implementation.
引用
收藏
页码:41 / 48
页数:8
相关论文
共 12 条
  • [1] [Anonymous], TN4101 MICR
  • [2] Saliency, attention, and visual search: An information theoretic approach
    Bruce, Neil D. B.
    Tsotsos, John K.
    [J]. JOURNAL OF VISION, 2009, 9 (03):
  • [3] High-order contrasts for independent component analysis
    Cardoso, JF
    [J]. NEURAL COMPUTATION, 1999, 11 (01) : 157 - 192
  • [4] A model of saliency-based visual attention for rapid scene analysis
    Itti, L
    Koch, C
    Niebur, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) : 1254 - 1259
  • [5] KOCH C, 1985, HUM NEUROBIOL, V4, P219
  • [6] Koch Christof, 2000, SELECTIVE VISUAL ATT
  • [7] Li Zhaoping, 2002, SALIENCY MAP PRIMARY
  • [8] Nvidia, GEF GTS 250 SPEC
  • [9] Nvidia Victor Podlozhnyuk, 2007, FFT BAS 2D CONV
  • [10] Nvidia Victor Podlozhnyuk, 2007, HIST CALC CUD