Analysis and Design of a Context Adaptable SAD/MSE Architecture

被引:1
作者
Sudarsanam, Arvind [1 ]
Dasu, Aravind [1 ]
Vaithianathan, Karthik [2 ]
机构
[1] Utah State Univ, Reconfigurable Comp Grp, Old Main Hill,UMC-4120, Logan, UT 84321 USA
[2] Intel Corp, Visual Comp Grp, Hillsboro, OR 97124 USA
关键词
D O I
10.1155/2009/789592
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Design of flexible multimedia accelerators that can cater to multiple algorithms is being aggressively pursued in the media processors community. Such an approach is justified in the era of sub-45 nmtechnology where an increasingly dominating leakage power component is forcing designers to make the best possible use of on-chip resources. In this paper we present an analysis of two commonly used window-based operations (sum of absolute differences and mean squared error) across a variety of search patterns and block sizes (2 x 3, 5 x 5, etc.). We propose a context adaptable architecture that has (i) configurable 2D systolic array and (ii) 2D Configurable Register Array (CRA). CRA can cater to variable pixel access patterns while reusing fetched pixels across search windows. Benefits of proposed architecture when compared to 15 other published architectures are adaptability, high throughput, and low latency at a cost of increased footprint, when ported on a Xilinx FPGA. Copyright (C) 2009 Arvind Sudarsanam et al.
引用
收藏
页数:21
相关论文
共 20 条
[1]  
[Anonymous], 2007, XILINX ISE 10 1 MANU
[2]  
Byun M., 2005, P INT C IM PROC ICIP, V2, P1018
[3]   Analysis and architecture design of variable block-size motion estimation for H.264/AVC [J].
Chen, CY ;
Chien, SY ;
Huang, YW ;
Chen, TC ;
Wang, TC ;
Chen, LG .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2006, 53 (03) :578-593
[4]   PARAMETERIZABLE VLSI ARCHITECTURES FOR THE FULL-SEARCH BLOCK-MATCHING ALGORITHM [J].
DEVOS, L ;
STEGHERR, M .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (10) :1309-1316
[5]  
Kim NS, 2003, COMPUTER, V36, P68, DOI 10.1109/MC.2003.1250885
[6]   Frame-level pipelined motion estimation array processor [J].
Kittitornkun, S ;
Hu, YH .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (02) :248-251
[7]   ARRAY ARCHITECTURES FOR BLOCK MATCHING ALGORITHMS [J].
KOMAREK, T ;
PIRSCH, P .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (10) :1301-1308
[8]  
Kung H., INTRO VLSI SYSTEMS
[9]   A data-interlacing architecture with two-dimensional data-reuse for full-search block-matching algorithm [J].
Lai, YK ;
Chen, LG .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (02) :124-127
[10]   An efficient VLSI architecture for H.264 variable block size motion estimation [J].
Ou, CM ;
Le, CF ;
Hwang, WJ .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2005, 51 (04) :1291-1299