An Efficient Implementation of H.264/AVC Integer Motion Estimation Algorithm on Coarse-grained Reconfigurable Computing System

被引:1
作者
Nguyen, Kiem-Hung [1 ]
Cao, Peng [1 ]
Wang, Xue-Xiang [1 ]
机构
[1] Southeast Univ, Natl ASIC Syst Engn Res Ctr, Nanjing, Jiangsu, Peoples R China
基金
国家高技术研究发展计划(863计划);
关键词
Reconfigurable Computing; REMUS; Coarse-grained Dynamically Reconfigurable Architecture; H264/AVC; Variable Block Size Integer Motion Estimation;
D O I
10.4304/jcp.8.3.594-604
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Variable block size integer motion estimation (VBS-IME) is one of several tools which contribute to H.264/AVC's excellent coding efficiency. However, its high computational complexity and huge memory access bandwidth make it difficult to implement. Therefore, a hardware accelerator is indispensable for full-search VBS-IME in real-time video encoding applications. To overcome some of the limitations of conventional microprocessors and fine-grained reconfigurable devices in the field of multimedia and communication baseband processing, we have proposed a coarse-grained dynamically reconfigurable computing system, called REMUS. The paper presents architecture and compiling flow proposed for REMUS system, and shows that it is possible to implement a high complexity application as H.264/AVC full-search VBS-IME algorithm with competitive performance on platform of REMUS system. Experimental results have proven that the REMUS system operating at 200 MHz can perform VBSIME at real-time speed for CIF/SDTV@30fps video sequences with two reference frames and maximum search range of [-16,15]/[-8,7]. The implementation, therefore, can apply for H.264/AVC encoder in mobile multimedia applications. REMUS system is designed and synthesized by using TSMC 65nm low power technology. The die size of REMUS is 23.7 mm(2). REMUS consumes about 194mW while working at 200MHz.
引用
收藏
页码:594 / 604
页数:11
相关论文
共 18 条
[1]  
Bondalapati Kiran, 1998, INT WORKSH FIELD PRO
[2]  
CHEN Z, 2002, FAST INTEGER PEL FRA
[3]  
Ebeling Carl, 1997, MAPPING APPL RAP CON
[4]  
Gajski D., 1992, HIGH LEVEL SYNTHESIS
[5]  
Huang Yu-Wen, 2003, P INT S CIRC SYST
[6]  
KUNG HT, 1980, P ACM SIGMOD INT C M, P105
[7]   SUCCESSIVE ELIMINATION ALGORITHM FOR MOTION ESTIMATION [J].
LI, W ;
SALARI, E .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (01) :105-107
[8]   Reconfigurable system-on-a-chip motion estimation architecture for multi-standard video coding [J].
Lu, L. ;
McCanny, J. V. ;
Sezer, S. .
IET COMPUTERS AND DIGITAL TECHNIQUES, 2010, 4 (05) :349-364
[9]  
Mei B., 2003, ADRES ARCHITECTURE T, P61, DOI DOI 10.1007/978-3-540-45234-8_7
[10]   An efficient VLSI architecture for H.264 variable block size motion estimation [J].
Ou, CM ;
Le, CF ;
Hwang, WJ .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2005, 51 (04) :1291-1299