Rational filters for passive depth from defocus

被引:161
作者
Watanabe, M
Nayar, SK
机构
[1] Hitachi Ltd, Prod Engn Res Lab, Yokohama, Kanagawa 244, Japan
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
passive depth from defocus; blur function; scene textures; normalized image ratio; broadband rational operators; texture invariance; depth confidence measure; depth estimation; real-time performance;
D O I
10.1023/A:1007905828438
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental problem in depth from defocus is the measurement of relative defocus between images. The performance of previously proposed focus operators are inevitably sensitive to the frequency spectra of local scene textures. As a result, focus operators such as the Laplacian of Gaussian result in poor depth estimates. An alternative is to use large filter banks that densely sample the frequency space. Though this approach can result in better depth accuracy, it sacrifices the computational efficiency that depth from defocus offers over stereo and structure from motion. We propose a class of broadband operators that, when used together, provide invariance to scene texture and produce accurate and dense depth maps. Since the operators are broadband, a small number of them are sufficient for depth estimation of scenes with complex textural properties. In addition, a depth confidence measure is derived that can be computed from the outputs of the operators. This confidence measure permits further refinement of computed depth maps. Experiments are conducted on both synthetic and real scenes to evaluate the performance of the proposed operators. The depth detection gain error is less than 1%, irrespective of texture frequency. Depth accuracy is found to be 0.5 similar to 1.2% of the distance of the object from the imaging optics.
引用
收藏
页码:203 / 225
页数:23
相关论文
共 31 条
[1]  
[Anonymous], 1994, PYRAMID FRAMEWORK EA
[2]  
Besl P. J., 1988, GMR6090 GEN MOT RES, VGMR-6090
[3]  
BORN M, 1965, PRINCIPLES OPTICS
[4]  
BOVE VM, 1993, J OPT SOC AM A, V10, P561, DOI 10.1364/JOSAA.10.000561
[5]  
Bracewell R.N., 1965, The Fourier Transform and Its Applications
[6]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[7]  
Darrell T., 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.88CH2605-4), P504, DOI 10.1109/CVPR.1988.196282
[8]  
Ens J., 1991, Proceedings 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (91CH2983-5), P600, DOI 10.1109/CVPR.1991.139760
[9]  
GOKSTORP M, 1994, P INT C PATT REC
[10]  
Hoel P., 1971, INTRO MATH STAT