Automatic Depth Extraction from 2D Images Using a Cluster-Based Learning Framework

被引:28
作者
Herrera, Jose L. [1 ]
del-Blanco, Carlos R. [1 ]
Garcia, Narciso [1 ]
机构
[1] Univ Politecn Madrid, Grp Tratamiento Imagenes, E-28040 Madrid, Spain
关键词
Depth extraction; 2D-to-3D conversion; machine learning; depth maps; clustering; SCENE; SHAPE;
D O I
10.1109/TIP.2018.2813093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There has been a significant increase in the availability of 3D players and displays in the last years. Nonetheless, the amount of 3D content has not experimented an increment of such magnitude. To alleviate this problem, many algorithms for converting images and videos from 2D to 3D have been proposed. Here, we present an automatic learning-based 2D-3D image conversion approach, based on the key hypothesis that color images with similar structure likely present a similar depth structure. The presented algorithm estimates the depth of a color query image using the prior knowledge provided by a repository of color + depth images. The algorithm clusters this database attending to their structural similarity, and then creates a representative of each color-depth image cluster that will be used as prior depth map. The selection of the appropriate prior depth map corresponding to one given color query image is accomplished by comparing the structural similarity in the color domain between the query image and the database. The comparison is based on a K-Nearest Neighbor framework that uses a learning procedure to build an adaptive combination of image feature descriptors. The best correspondences determine the cluster, and in turn the associated prior depth map. Finally, this prior estimation is enhanced through a segmentation-guided filtering that obtains the final depth map estimation. This approach has been tested using two publicly available databases, and compared with several state-of-the-art algorithms in order to prove its efficiency.
引用
收藏
页码:3288 / 3299
页数:12
相关论文
共 30 条
[1]   A 2D to 3D video and image conversion technique based on a bilateral filter [J].
Angot, Ludovic J. ;
Huang, Wei-Jia ;
Liu, Kai-Che .
THREE-DIMENSIONAL IMAGE PROCESSING (3DIP) AND APPLICATIONS, 2010, 7526
[2]  
[Anonymous], IEEE 3DTV C 2014 JUL
[3]   Contour Detection and Hierarchical Image Segmentation [J].
Arbelaez, Pablo ;
Maire, Michael ;
Fowlkes, Charless ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916
[4]   A Novel 2D-to-3D Conversion System Using Edge Information [J].
Cheng, Chao-Chung ;
Li, Chung-Te ;
Chen, Liang-Gee .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (03) :1739-1745
[5]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[6]   A NEW MODE SELECTION TECHNIQUE FOR CODING DEPTH MAPS OF 3D VIDEO [J].
De Silva, D. V. S. X. ;
Fernando, W. A. C. ;
Arachchi, H. Kodikara .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :686-689
[7]   Semi-automatic Stereo Extraction from Video Footage [J].
Guttmann, Moshe ;
Wolf, Lior ;
Cohen-Or, Daniel .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :136-142
[8]  
Herrera JL, 2014, IEEE IMAGE PROC, P2022, DOI 10.1109/ICIP.2014.7025405
[9]  
Hoiem D, 2005, IEEE I CONF COMP VIS, P654
[10]   Regions of interest extraction from color image based on visual saliency [J].
Huang, Chaobing ;
Liu, Quan ;
Yu, Shengsheng .
JOURNAL OF SUPERCOMPUTING, 2011, 58 (01) :20-33