Survey on block matching motion estimation algorithms and architectures with new results

被引:102
作者
Huang, YW [1 ]
Chen, CY
Tsai, CH
Shen, CF
Chen, LG
机构
[1] Natl Taiwan Univ, Grad Inst Elect Engn, DSP IC Design Lab, Taipei 10617, Taiwan
[2] Natl Taiwan Univ, Dept Elect Engn, Taipei 10617, Taiwan
来源
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2006年 / 42卷 / 03期
关键词
block matching; motion estimation; global elimination algorithm; VLSI architecture;
D O I
10.1007/s11265-006-4190-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Block matching motion estimation is the heart of video coding systems. During the last two decades, hundreds of fast algorithms and VLSI architectures have been proposed. In this paper, we try to provide an extensive exploration of motion estimation with our new developments. The main concepts of fast algorithms can be classified into six categories: reduction in search positions, simplification of matching criterion, bitwidth reduction, predictive search, hierarchical search, and fast full search. Comparisons of various algorithms in terms of video quality and computational complexity are given as useful guidelines for software applications. As for hardware implementations, full search architectures derived from systolic mapping are first introduced. The systolic arrays can be divided into inter-type and intra-type with 1-D, 2-D, and tree structures. Hexagonal plots are presented for system designers to clearly evaluate the architectures in six aspects including gate count, required frequency, hard-ware utilization, memory bandwidth, memory bitwidth, and latency. Next, architectures supporting fast algorithms are also reviewed. Finally, we propose our algorithmic and architectural co-development. The main idea is quick checking of the entire search range with simplified matching criterion to globally eliminate impossible candidates, followed by finer selection among potential best matched candidates. The operations of the two stages are mapped to the same hardware for resource sharing. Simulation results show that our design is ten times more area-speed efficient than full search architectures while the video quality is competitively the same.
引用
收藏
页码:297 / 320
页数:24
相关论文
共 100 条
[1]  
[Anonymous], 144962 ISOIEC
[2]  
Bierling M., 1988, Proceedings of the SPIE - The International Society for Optical Engineering, V1001, P942, DOI 10.1117/12.969046
[3]   Fast full-search block matching [J].
Brünig, M ;
Niehsen, W .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (02) :241-247
[4]   Fast motion vector estimation using multiresolution-spatio-temporal correlations [J].
Chalidabhongse, J ;
Kuo, CCJ .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (03) :477-488
[5]   SCALABLE ARRAY ARCHITECTURE DESIGN FOR FULL SEARCH BLOCK MATCHING [J].
CHANG, SF ;
HWANG, JH ;
JEN, CW .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1995, 5 (04) :332-343
[6]  
Chao WM, 2002, 2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, PROCEEDINGS, P492
[7]  
Chao WM, 2003, PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, P788
[8]  
CHEN CY, 2004, UNPUB IEEE T CIRCUIT, V1
[9]   An Efficient Parallel Motion Estimation Algorithm for Digital Image Processing [J].
Chen, Liang-Gee ;
Chen, Wai-Ting ;
Jehng, Yeu-Shen ;
Chiueh, Tzi-Dar .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1991, 1 (04) :378-385
[10]  
CHEN MC, 1995, P IEEE MULT COMM VID