Principled fusion of high-level model and low-level cues for motion segmentation

被引:0
作者
Thayananthan, Arasanathan [1 ]
Iwasaki, Masahiro [2 ]
Cipolla, Roberto [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
[2] Panasonic Europe Ltd, Cambridge CB3 0AX, England
来源
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12 | 2008年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-level generative models provide elegant descriptions of videos and are commonly used as the inference framework in many unsupervised motion segmentation schemes. However, approximate inference in these models often require ad-hoc initialization to avoid local minima issues. Low-level cues, obtained independently from the high-level model, can constrain the search space and reduce the chance of inference algorithms falling into a local minima. This paper introduces a novel principled fusion framework where, local hierarchical superpixels segmentation of images are used to capture local motion. The low-level cues such as local motion, on their own, not adequate to obtain full motion segmentation as occlusion needs to be handled globally. We fuse the low-level motion cues with the high-level model in a principled manner to surmount the shortcomings of using only the high-level model or low-level cues to perform motion segmentation. The fused model contains both continuous and discrete variables which forms a number of Markov Random fields. Variational approximation or belief propagation algorithms cannot be applied due to the complex interactions between the variables. Hence, approximate inference is performed using expectation propagation (EP) algorithm. The scheme is demonstrated by performing motion segmentation in two video sequences.
引用
收藏
页码:751 / +
页数:3
相关论文
共 12 条
[1]  
ALLAN M, 2005, P BRIT MACH VIS C
[2]  
[Anonymous], 2005, P IEEE INT C COMP VI
[3]  
[Anonymous], P C UNC ART INT
[4]  
JOJIC N, 2001, P C COMP VIS PATT RE
[5]  
JOJIC N, 2006, P C COMP VIS PATT RE
[6]  
KUMAR MP, 2005, P INT C COMP VIS
[7]  
REN X, 2003, P C COMP VIS PATT RE
[8]  
SIDENBLADH H, 2000, P IEEE INT C AUT FAC
[9]  
Stein A., 2007, P INT C COMP VIS
[10]   REPRESENTING MOVING IMAGES WITH LAYERS [J].
WANG, JYA ;
ADELSON, EH .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (05) :625-638