Effect of scaling on the area and performance of the H.264/AVC full-search fractional motion estimation algorithm on field-programmable gate arrays

被引:0
作者
Vasiljevic, J. [1 ]
Ye, A. G. [1 ]
机构
[1] Ryerson Univ, Dept Elect & Comp Engn, Toronto, ON M5B 2K3, Canada
关键词
ARCHITECTURE DESIGN; VLSI ARCHITECTURE;
D O I
10.1049/iet-cdt.2010.0167
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Fractional motion estimation (FME) is an important part of the H.264/AVC video encoding standard. The algorithm can significantly increase the compression ratio of video encoders while preserving high video quality. The full-search FME algorithm, however, is computationally expensive and can consist of over 45% of the total motion estimation process. To maximise the performance and efficiency of FME implementations on field-programmable gate arrays (FPGAs), one needs to efficiently exploit the inherent parallelism in the algorithm. The authors investigate the scalability of the full-search FME algorithm on FPGAs and also implemented six scaled versions of the algorithm on Xilinx Virtex-5 FPGAs. The authors found that scaling the algorithm vertically within a 4 x 4 sub-block is more efficient than scaling horizontally across several sub-blocks. It is shown that, with four reference frames, the best vertically scaled design can achieve 96 frames-per-second (fps) performance while encoding full 1920 x 1088 progressive HDTV video, and the design only consumes 25.5 K LUTS and 28.7 K registers.
引用
收藏
页码:95 / 104
页数:10
相关论文
共 24 条
[1]  
Chang JF, 2005, IEEE INT SYM MULTIM, P491
[2]  
Chao W., 2003, P INT S CIRC SYST BA, pII788
[3]   Analysis and architecture design of variable block-size motion estimation for H.264/AVC [J].
Chen, CY ;
Chien, SY ;
Huang, YW ;
Chen, TC ;
Wang, TC ;
Chen, LG .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2006, 53 (03) :578-593
[4]  
Chen TC, 2004, 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, P273
[5]  
Chen TC, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS, P9
[6]  
Chen TC, 2006, ASIA S PACIF DES AUT, P750
[7]   Analysis and architecture design of an HDTV720p 30 frames/s H.264/AVC encoder [J].
Chen, Tung-Chien ;
Chien, Shao-Yi ;
Huang, Yu-Wen ;
Tsai, Chen-Han ;
Chen, Ching-Yeh ;
Chen, To-Wei ;
Chen, Liang-Gee .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (06) :673-688
[8]  
Chen Y., 2007, P PICT COD S LISB PO, P13
[9]   VLSI Architecture Design of Fractional Motion Estimation for H.264/AVC [J].
Chen, Yi-Hau ;
Chen, Tung-Chien ;
Chien, Shao-Yi ;
Huang, Yu-Wen ;
Chen, Liang-Gee .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2008, 53 (03) :335-347
[10]   Fast integer-pel and fractional-pel motion estimation for H.264/AVC [J].
Chen, Zhibo ;
Xu, Jianfeng ;
He, Yun ;
Zheng, Junli .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2006, 17 (02) :264-290