Region-of-interest based resource allocation for conversational video communication of H.264/AVC

被引:125
作者
Liu, Yang [1 ]
Li, Zheng Guo [2 ]
Soh, Yeng Chai [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Ctr Modeling & Control Complex Syst, Singapore 639798, Singapore
[2] Inst Infocomm Res, Media Div, Singapore 119613, Singapore
关键词
human visual system (HVS); H.264/AVC; low bit-rate coding; real time video communication; region-of-interest (ROI);
D O I
10.1109/TCSVT.2007.913754
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the complexity of H.264/AVC, it is very challenging to apply this standard to design a conversational video communication system. This problem is addressed in this paper by using region-of-interest (1101) based bit allocation and computational power allocation schemes. In our system, the ROI is first detected by using the direct frame difference and skin-tone information. Several coding parameters including quantization parameter, candidates for mode decision, the number of referencing frames, accuracy of motion vectors and the search range of motion estimation are adaptively adjusted at the macroblock (MB) level according to the relative importance of each MB. Subsequently, the encoder could allocate more resources such as bits and computational power to the ROI, and the decoding complexity is also optimized at the encoder side by utilizing an ROI based rate-distortion-complexity (R-D-C) cost function. The encoder is thus simplified and decoding-friendly, and the overall subjective visual quality can also be improved.
引用
收藏
页码:134 / 139
页数:6
相关论文
共 22 条
[1]  
Bojkovic Z, 2004, NEUREL 2004: SEVENTH SEMINAR ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, P67
[2]  
CHI MC, 2004, P IEEE INT S CIRC SY
[3]  
DAI Q, 2004, P IEEE INT C IM PROC, P1123
[4]   Face-texture model based on SGLD and its application in face detection in a color scene [J].
Dai, Y ;
Nakano, Y .
PATTERN RECOGNITION, 1996, 29 (06) :1007-1017
[5]   Low bit-rate coding of image sequences using adaptive regions of interest [J].
Doulamis, N ;
Doulamis, A ;
Kalogeras, D ;
Kollias, S .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (08) :928-934
[6]   Power-rate-distortion analysis for wireless video communication under energy constraints [J].
He, ZH ;
Liang, YF ;
Chen, LL ;
Ahmad, I ;
Wu, DP .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (05) :645-658
[7]   Face detection in color images [J].
Hsu, RL ;
Abdel-Mottaleb, M ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :696-706
[8]   Complexity of optimized H.26L video decoder implementation [J].
Lappalainen, V ;
Hallapuro, A ;
Hämäläinen, TD .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (07) :717-725
[9]  
Lee JS, 2002, MAR BIOTECHNOL, V4, P1, DOI 10.1007/s10126-001-0077-3
[10]   Adaptive rate control for H.264 [J].
Li, Z. G. ;
Gao, W. ;
Pan, F. ;
Ma, S. W. ;
Lim, K. P. ;
Feng, G. N. ;
Lin, X. ;
Rahardja, S. ;
Lu, H. Q. ;
Lu, Y. .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2006, 17 (02) :376-406