A 3D-HEVC Fast Mode Decision Algorithm for Real-Time Applications

被引:36
作者
Shen, Liquan [1 ]
An, Ping [1 ]
Zhang, Zhaoyang [1 ]
Hu, Qianqian [1 ]
Chen, Zhengchuan [2 ]
机构
[1] Shanghai Univ, Shanghai 200041, Peoples R China
[2] Tsinghua Univ, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Algorithms; 3D-HEVC; real-time applications; mode decision; computational complexity; motion estimation; RATE-DISTORTION OPTIMIZATION; CU SIZE DECISION; MULTIVIEW VIDEO; MOTION ESTIMATION; FAST DISPARITY; DEPTH; HEVC; TEXTURE; SELECTION; AVC;
D O I
10.1145/2700298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
3D High Efficiency Video Coding (3D-HEVC) is an extension of the HEVC standard for coding of multiview videos and depth maps. It inherits the same quadtree coding structure as HEVC for both components, which allows recursively splitting into four equal-sized coding units (CU). One of 11 different prediction modes is chosen to code a CU in inter-frames. Similar to the joint model of H.264/AVC, the mode decision process in HM (reference software of HEVC) is performed using all the possible depth levels and prediction modes to find the one with the least rate distortion cost using a Lagrange multiplier. Furthermore, both motion estimation and disparity estimation need to be performed in the encoding process of 3D-HEVC. Those tools achieve high coding efficiency, but lead to a significant computational complexity. In this article, we propose a fast mode decision algorithm for 3D-HEVC. Since multiview videos and their associated depth maps represent the same scene, at the same time instant, their prediction modes are closely linked. Furthermore, the prediction information of a CU at the depth level X is strongly related to that of its parent CU at the depth level X-1 in the quadtree coding structure of HEVC since two corresponding CUs from two neighboring depth levels share similar video characteristics. The proposed algorithm jointly exploits the inter-view coding mode correlation, the inter-component (texture-depth) correlation and the inter-level correlation in the quadtree structure of 3D-HEVC. Experimental results show that our algorithm saves 66% encoder runtime on average with only a 0.2% BD-Rate increase on coded views and 1.3% BD-Rate increase on synthesized views.
引用
收藏
页数:23
相关论文
共 46 条
[11]  
Correa G, 2013, 2013 IEEE EUROCON, P81, DOI 10.1109/EUROCON.2013.6624969
[12]   Motion vector sharing and bitrate allocation for 3D video-plus-depth coding [J].
Daribo, Ismael ;
Tillier, Christophe ;
Pesquet-Popescu, Beatrice .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
[13]   High Efficiency 3D Video Coding Using New Tools Based on View Synthesis [J].
Domanski, Marek ;
Stankiewicz, Olgierd ;
Wegner, Krzysztof ;
Kurc, Maciej ;
Konieczny, Jacek ;
Siast, Jakub ;
Stankowski, Jakub ;
Ratajczak, Robert ;
Grajek, Tomasz .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) :3517-3527
[14]  
Grewatsch S, 2004, IEEE IMAGE PROC, P3271
[15]  
Gu ZY, 2013, IEEE INT CONF MULTI
[16]  
Heming Sun, 2012, 2012 IEEE International Conference on Multimedia and Expo (ICME), P1085, DOI 10.1109/ICME.2012.4
[17]   A Hybrid Video Coder Based on Extended Macroblock Sizes, Improved Interpolation, and Flexible Motion Representation [J].
Karczewicz, Marta ;
Chen, Peisong ;
Joshi, Rajan L. ;
Wang, Xianglin ;
Chien, Wei-Jung ;
Panchal, Rahul ;
Reznik, Yuriy ;
Coban, Muhammed ;
Chong, In Suk .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (12) :1698-1708
[18]  
Kim J, 2012, 2012 PICTURE CODING SYMPOSIUM (PCS), P449, DOI 10.1109/PCS.2012.6213251
[19]   Fast disparity and motion estimation for mufti-view video coding [J].
Kim, Yongtae ;
Kim, Jiyoung ;
Sohn, Kwanghoon .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2007, 53 (02) :712-719
[20]   A Fast and Efficient Multi-View Depth Image Coding Method Based on Temporal and Inter-View Correlations of Texture Images [J].
Lee, Jin Young ;
Wey, Ho-Cheon ;
Park, Du-Sik .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (12) :1859-1868