A Novel 2D to 3D Video Conversion System Based on a Machine Learning Approach

被引:9
作者
Herrera, Jose L. [1 ]
del-Blanco, Carlos R. [1 ]
Garcia, Narciso [1 ]
机构
[1] Univ Politecn Madrid, Grp Tratamiento Imagenes, E-28040 Madrid, Spain
关键词
depth extraction; 2D-to-3D conversion; depth maps; machine learning; clustering; DEPTH;
D O I
10.1109/TCE.2016.7838096
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
There has been recently a significant increase in the number of available 3D displays and players. Nevertheless, the amount of 3D content has not increased in the same magnitude, creating a gap between 3D offer and demand. To reduce this difference, many algorithms have appeared that perform 2D-to-3D image and video conversion. These algorithms usually require several images from the same scene to perform the conversion. In this paper, an automatic algorithm for estimating the 3D structure of a scene from a single color image is proposed. It is based on the key assumption that color images with similar structure will likely present similar depth structures. The conversion algorithm is split into an offline and an online module to be easily deployable into consumer devices, such as smartphones or TVs. The offline module pre-processes a color+ depth image database to speed up the subsequent depth estimation. The online module infers a depth prior from a color query image using the previous database as training data. Then, it is refined through a segmentation-guided filtering. The conversion algorithm has been evaluated in three publicly available databases, and compared with several state-of-the-art algorithms to prove its efficiency(1).
引用
收藏
页码:429 / 436
页数:8
相关论文
共 29 条
[1]   A 2D to 3D video and image conversion technique based on a bilateral filter [J].
Angot, Ludovic J. ;
Huang, Wei-Jia ;
Liu, Kai-Che .
THREE-DIMENSIONAL IMAGE PROCESSING (3DIP) AND APPLICATIONS, 2010, 7526
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]   Contour Detection and Hierarchical Image Segmentation [J].
Arbelaez, Pablo ;
Maire, Michael ;
Fowlkes, Charless ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916
[4]   A Novel 2D-to-3D Conversion System Using Edge Information [J].
Cheng, Chao-Chung ;
Li, Chung-Te ;
Chen, Liang-Gee .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (03) :1739-1745
[5]  
Cherian Anoop, 2009, 2009 IEEE International Conference on Robotics and Automation (ICRA), P2243, DOI 10.1109/ROBOT.2009.5152260
[6]   Semi-automatic Stereo Extraction from Video Footage [J].
Guttmann, Moshe ;
Wolf, Lior ;
Cohen-Or, Daniel .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :136-142
[7]  
Heitz G., 2008, Proceedings of NIPS, P641
[8]  
Herrera JL, 2014, IEEE IMAGE PROC, P2022, DOI 10.1109/ICIP.2014.7025405
[9]   Recovering surface layout from an image [J].
Hoiem, Derek ;
Efros, Alexei A. ;
Hebert, Martial .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 75 (01) :151-172
[10]   Regions of interest extraction from color image based on visual saliency [J].
Huang, Chaobing ;
Liu, Quan ;
Yu, Shengsheng .
JOURNAL OF SUPERCOMPUTING, 2011, 58 (01) :20-33