Correspondence as energy-based segmentation

被引:11
作者
Birchfield, Stanley T. [1 ]
Natarajan, Braga
Tomasi, Carlo
机构
[1] Clemson Univ, Dept Elect & Comp Engn, Clemson, SC 29634 USA
[2] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
stereo; motion; correspondence; segmentation; multiway-cut; graph cuts; affine; energy minimization;
D O I
10.1016/j.imavis.2006.08.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We pose the correspondence problem as one of energy-based segmentation. In this framework, correspondence assigns each pixel in an image to exactly one of several non-overlapping regions, and it also computes a displacement function for each region. The framework is better able to capture the scene geometry than the more direct formulation of matching pixels in two or more images, particularly when the surfaces in the scene are not fronto-parallel. To illustrate the framework, we present a specific correspondence algorithm that minimizes an energy functional by alternating between (1) segmenting the image into a number of non-overlapping regions using the multiway-cut algorithm of Boykov, Veksler, and Zabih; and (2) finding the affine parameters describing the displacement of the pixels in each region. After convergence, a final step escapes local minima due to over-segmentation. The basic algorithm is extended in two ways: using ground control points to detect long, thin regions; and warping segmentation results to efficiently process image sequences. Experiments on real images show the algorithm's ability to find an accurate segmentation and displacement map, as well as discontinuities and creases, on a wide variety of stereo and motion imagery. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:1329 / 1340
页数:12
相关论文
共 33 条
[1]  
[Anonymous], 1991, P INT JOINT C ART IN
[2]  
AYER S, 1995, FIFTH INTERNATIONAL CONFERENCE ON COMPUTER VISION, PROCEEDINGS, P777, DOI 10.1109/ICCV.1995.466859
[3]  
BAKER HH, 1981, P 7 INT JOINT C ART, P631
[4]  
Belhumeur P. N., 1993, [1993] Proceedings Fourth International Conference on Computer Vision, P431, DOI 10.1109/ICCV.1993.378184
[5]   Depth discontinuities by pixel-to-pixel stereo [J].
Birchfield, S ;
Tomasi, C .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1999, 35 (03) :269-293
[6]  
Blake A., 1987, Visual Reconstruction
[7]   A layered stereo matching algorithm using image segmentation and global visibility constraints [J].
Bleyer, M ;
Gelautz, M .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2005, 59 (03) :128-150
[8]   Markov random fields with efficient approximations [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
1998 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1998, :648-655
[9]   Fast approximate energy minimization via graph cuts [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) :1222-1239
[10]  
BOYKOV Y, 1999, P IEEE C COMP VIS PA