Unsupervised video object segmentation by spatiotemporal graphical model

被引:0
作者
Lijun Guo
Tingting Cheng
Yuanjie Huang
Jieyu Zhao
Rong Zhang
机构
[1] Ningbo University,College of Information Science and Engineering
来源
Multimedia Tools and Applications | 2017年 / 76卷
关键词
Unsupervised; Spatiotemporal smoothness; Supervoxel; Conditional random field;
D O I
暂无
中图分类号
学科分类号
摘要
We propose a novel spatiotemporal graphical model for unsupervised video object segmentation. The core of our model is a layered-CRF (conditional random field) that contains two layers, i.e., pixel layer and supervoxel layer. First, the heat diffusion based segmentation and salient region detection is integrated to obtain the segmentation results of the first frame. The results are used as input seeds to train dual probabilistic models of each object class. In the spatiotemporal layered-CRF framework we extend binary segmentation to multiple object segmentation. We add intra-frame spatial matching potential and inter-frame temporal supervoxels consistent potential to link the pixel layer and the supervoxel layer. This improves the spatiotemporal smoothing throughout the video sequence in the proposed model. The proposed unsupervised method lightens the burden of labeling training samples and obtains a smooth and accurate object boundary in video segmentation. The experiments on two public datasets demonstrate that our method outperforms several state-of-the-art methods in both single and multiple foreground cases.
引用
收藏
页码:1037 / 1053
页数:16
相关论文
共 29 条
[1]  
Achanta R(2012)SLIC superpixels compared to state-of-the-art superpixel methods IEEE Transactions on Pattern Analysis and Machine Intelligence 34 2274-2282
[2]  
Shaji A(2012)Fully automatic extraction of salient objects from videos in near real time Comput J 55 3-14
[3]  
Smith K(2013)Semi-supervised video segmentation using tree structured graphical models IEEE Transactions on Pattern Analysis and Machine Intelligence 35 2751-2764
[4]  
Lucchi A(2001)Fast approximate energy minimization via graph cuts IEEE Transactions on Pattern Analysis and Machine Intelligence 23 1222-1239
[5]  
Fua P(2009)Salient region detection by modeling distributions of color and orientation IEEE Transactions on Multimedia 11 892-905
[6]  
Susstrunk S(2013)Cluster-based Co-saliency detection IEEE Trans Image Process 22 3766-3778
[7]  
Akamine K(2001)Representing and recognizing the visual appearance of materials using three-dimensional textons Int J Comput Vis 43 29-44
[8]  
Fukuchi K(2004)Distinctive image features from scale-invariant keypoints Int J Comput Vis 60 91-110
[9]  
Kimura A(2012)Motion coherent tracking using multi-label MRF optimization Int J Comput Vis 100 190-202
[10]  
Takagi S(undefined)undefined undefined undefined undefined-undefined