SINGLE IMAGE DEPTH ESTIMATION FROM IMAGE DESCRIPTORS

被引:0
作者
Lin, Yu-Hsun [1 ,3 ]
Cheng, Wen-Huang [3 ]
Miao, Hsin [2 ]
Ku, Tsung-Hao [2 ]
Hsieh, Yung-Huan [3 ]
机构
[1] Natl Taiwan Univ, Grad Inst Networking & Multimedia, Taipei 10764, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10764, Taiwan
[3] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
来源
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年
关键词
Depth Estimation; Single Image; SVM; Cloud Computing;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
With the rapid emergence of 3D displays, we can enrich the user's viewing experiences by adding depth information to the widely existing 2D contents. However, effectively inferring the associated depth from a single 2D image is still a challenging problem. By taking benefits from the recently appeared image descriptors, we proposed the use of an SVM based framework for addressing the single image depth estimation. One advantage is its direct extension to incorporate the recent researches of large scale classification via SVM to meet the upcoming cloud computing paradigm. Our experimental results showed that the proposed framework outperforms the state-of-the-art approaches in performance, even the ones using more complex graphical models like MRF. Also, we made a brief investigation on the individual effectiveness of a set of commonly used image descriptors and found that spatial descriptors (e. g. texture) would be more effective than frequency ones (e. g. DCT coefficients).
引用
收藏
页码:809 / 812
页数:4
相关论文
共 11 条
[1]  
[Anonymous], 2006, Advances in Neural Information Processing Systems, DOI [10.1109/TPAMI.2015.2505283a, DOI 10.1109/TPAMI.2015.2505283A, 10.1109/TPAMI.2015.2505283]
[2]  
Liu Beyang, CVPR 10, P1253
[3]  
Mendiburu Bernard., 2009, 3D Movie Making: Stereoscopic Digital Cinema From Scrip to Screen
[4]   A performance evaluation of local descriptors [J].
Mikolajczyk, K ;
Schmid, C .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (10) :1615-1630
[5]  
Saxena A., ICCV 07, P1
[6]   Make3D: Learning 3D Scene Structure from a Single Still Image [J].
Saxena, Ashutosh ;
Sun, Min ;
Ng, Andrew Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) :824-840
[7]   A taxonomy and evaluation of dense two-frame stereo correspondence algorithms [J].
Scharstein, D ;
Szeliski, R .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 47 (1-3) :7-42
[8]   DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo [J].
Tola, Engin ;
Lepetit, Vincent ;
Fua, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (05) :815-830
[9]   Depth estimation from image structure [J].
Torralba, A ;
Oliva, A .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (09) :1226-1238
[10]  
Yu Hsiang-Fu, ACM SIGKDD 10, P833