An epipolar geometry-based fast disparity estimation algorithm for multiview image and video coding

被引:40
作者
Lu, Jiangbo [1 ]
Cai, Hua
Lou, Jian-Guang
Li, Jiang
机构
[1] Univ Louvain, Dept Elect Engn, B-3001 Louvain, Belgium
[2] IMEC, Multimedia Grp, B-3001 Louvain, Belgium
[3] Microsoft Res Asia, Media Commun Grp, Beijing 100080, Peoples R China
关键词
disparity estimation (DE); epipolar geometry; fast motion estimation (ME); H.264/AVC; multiview image; multiview image compression; multiview video; multiview video compression; video coding;
D O I
10.1109/TCSVT.2007.896659
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Effectively coding multiview visual content is an indispensable research topic because multiview image and video that provide greatly enhanced viewing experiences often contain huge amounts of data. Generally, conventional hybrid predictive-coding methodologies are adopted to address the compression by exploiting the temporal and interviewpoint redundancy existing in a multiview image or video sequences. However, their key yet time-consuming component, motion estimation (ME), is usually not efficient in interviewpoint prediction or disparity estimation (DE), because interviewpoint disparity is completely different from temporal motion existing in the conventional video. Targeting a generic fast DE framework for interviewpoint prediction, we propose a novel DE technique in this paper to accelerate the disparity search by employing epipolar geometry. Theoretical analysis, optimal disparity vector distribution histograms, and experimental results show that the proposed epipolar geometry-based DE can greatly reduce search region and effectively track large and irregular disparity, which is typical in convergent multiview camera setups. Compared with the existing state-of-the-art fast ME approaches, our proposed DE can obtain a similar coding efficiency while achieving a significant speedup for interviewpoint prediction and coding. Moreover, a robustness study shows that the proposed DE algorithm is insensitive to the epipolar geometry estimation noise. Hence, its wide application for multiview image and video coding is promising.
引用
收藏
页码:737 / 750
页数:14
相关论文
共 27 条
[1]  
[Anonymous], P SIGGRAPH
[2]  
[Anonymous], 2005, JTC1SC29WG11 ISOIEC
[3]   SAD reuse in hierarchical motion estimation for the H.264 encoder [J].
Ates, HF ;
Altunbasak, Y .
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, :905-908
[4]  
AYDINOGLU H, 1994, IEEE IMAGE PROC, P385, DOI 10.1109/ICIP.1994.413597
[5]  
Chen SX, 2002, NEW CARBON MATER, V17, P6
[6]   A complexity-bounded motion estimation algorithm [J].
Chimienti, A ;
Ferraris, C ;
Pau, D .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2002, 11 (04) :387-392
[7]  
Hartley R., 2000, MULTIPLE VIEW GEOMET
[8]   In defense of the eight-point algorithm [J].
Hartley, RI .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (06) :580-593
[9]  
Hata K., 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348), P472, DOI 10.1109/ICIP.1999.822941
[10]  
*ISO IEC, 2005, JTC1SC29WG11