A Hardware-Efficient Block Matching Algorithm and Its Hardware Design for Variable Block Size Motion Estimation in Ultra-High-Definition Video Encoding

被引:9
作者
Zheng, Jianwei [1 ,2 ]
Lu, Chao [3 ]
Guo, Jiefeng [1 ]
Chen, Deming [2 ]
Guo, Donghui [1 ]
机构
[1] Xiamen Univ, Dept Elect Engn, Xiamen 361005, Fujian, Peoples R China
[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL 61801 USA
[3] Southern Illinois Univ Carbondale, Dept Elect & Comp Engn, Carbondale, IL 62901 USA
关键词
Motion estimation; video encoding; hardware architecture; memory organization; variable block size; hardware efficiency; VLSI ARCHITECTURE; SEARCH ALGORITHM; TERMINATION ALGORITHM; DATA REUSE; MEMORY; RANGE;
D O I
10.1145/3290408
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Variable block size motion estimation has contributed greatly to achieving an optimal interframe encoding, but involves high computational complexity and huge memory access, which is the most critical bottleneck in ultra-high-definition video encoding. This article presents a hardware-efficient block matching algorithm with an efficient hardware design that is able to reduce the computational complexity of motion estimation while providing a sustained and steady coding performance for high-quality video encoding. A three-level memory organization is proposed to reduce memory bandwidth requirement while supporting a predictive common search window. By applying multiple search strategies and early termination, the proposed design provides 1.8 to 3.7 times higher hardware efficiency than other works. Furthermore, on-chip memory has been reduced by 96.5% and off-chip bandwidth requirement has been reduced by 39.4% thanks to the proposed three-level memory organization. The corresponding power consumption is only 198mW at the highest working frequency of 500MHz. The proposed design is attractive for high-quality video encoding in real-time applications with low power consumption.
引用
收藏
页数:21
相关论文
共 37 条
[1]  
AlQaralleh EA, 2015, 2015 FIFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND ITS APPLICATIONS (DICTAP), P13, DOI 10.1109/DICTAP.2015.7113163
[2]   An Advanced Hierarchical Motion Estimation Scheme With Lossless Frame Recompression and Early-Level Termination for Beyond High-Definition Video Coding [J].
Bao, Xuena ;
Zhou, Dajiang ;
Liu, Peilin ;
Goto, Satoshi .
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (02) :237-249
[3]  
Bjontegaard G., 2001, P ITU T Q 6 SG16 VCE
[4]   Analysis and architecture design of variable block-size motion estimation for H.264/AVC [J].
Chen, CY ;
Chien, SY ;
Huang, YW ;
Chen, TC ;
Wang, TC ;
Chen, LG .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2006, 53 (03) :578-593
[5]   Level C+ data reuse scheme for motion estimation with corresponding coding orders [J].
Chen, CY ;
Huang, CT ;
Chen, YH ;
Chen, LG .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (04) :553-558
[6]   A 252Kgates/4.9Kbytes SRAM/71mW Multistandard Video Decoder for High Definition Video Applications [J].
Chien, Chih-Da ;
Chien, Cheng-An ;
Chu, Jui-Chin ;
Guo, Jiun-In ;
Cheng, Ching-Hwa .
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2009, 14 (01)
[7]   A 212 MPixels/s 4096 x 2160p Multiview Video Encoder Chip for 3D/Quad Full HDTV Applications [J].
Ding, Li-Fu ;
Chen, Wei-Yin ;
Tsung, Pei-Kuei ;
Chuang, Tzu-Der ;
Hsiao, Pai-Heng ;
Chen, Yu-Han ;
Chiu, Hsu-Kuang ;
Chien, Shao-Yi ;
Chen, Liang-Gee .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2010, 45 (01) :46-58
[8]   Fast Motion Estimation System Using Dynamic Models for H.264/AVC Video Coding [J].
Ismail, Yasser ;
McNeely, Jason B. ;
Shaaban, Mohsen ;
Mahmoud, Hanan ;
Bayoumi, Magdy A. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (01) :28-42
[9]  
JVT, 2017, JOINT VID TEAM REF S
[10]   A Memory-Efficient and Highly Parallel Architecture for Variable Block Size Integer Motion Estimation in H.264/AVC [J].
Kao, Chao-Yang ;
Lin, Youn-Long .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2010, 18 (06) :866-874