Unsupervised Video Object Segmentation Based on Mixture Models and Saliency Detection

被引:0
作者
Guofeng Lin
Wentao Fan
机构
[1] Huaqiao University,Department of Computer Science and Technology
来源
Neural Processing Letters | 2020年 / 51卷
关键词
Video object segmentation; Gaussian mixture model; Markov random field; saliency detection;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, we propose an unsupervised video object segmentation approach which is mainly based on a saliency detection method and the Gaussian mixture model with Markov random field. In our approach, the saliency detection method is developed as a preprocessing technique to calculate the probability of each pixel as the target object. In contrast to traditional saliency detection methods which are normally difficult to obtain the object’s precise boundary and are therefore hard to segment consistent objects, the developed saliency detection method can calculate the saliency of each frame in the video sequence and extract the position and region of the target object with more accurate object boundary. The refined extracted object region is then taken as the prior information and incorporated into the Gaussian mixture model with Markov random field to obtain the precise pixel-wise segmentation result of each frame. The effectiveness of the proposed unsupervised video object segmentation approach is validated through experimental results using both the SegTrack and the SegTrack v2 data sets.
引用
收藏
页码:657 / 674
页数:17
相关论文
共 90 条
[21]  
Fan W(2010)Segmenting salient objects from images and videos Comput Vis- ECCV 22 888-281
[22]  
Hu C(2000)Normalized cuts and image segmentation IEEE Trans Pattern Anal Mach Intell 2010 268-33
[23]  
Du J(2010)Multiple hypothesis video segmentation from superpixel flows Comput Vis ECCV 40 20-4024
[24]  
Bouguila N(2018)Saliency-aware video object segmentation IEEE Trans Pattern Anal Mach Intell 47 4014-2432
[25]  
Fan W(2017)Deep multimodal distance metric learning using click constraints for image ranking IEEE Trans Cybern 27 2420-undefined
[26]  
Bouguila N(2019)Spatial pyramid-enhanced NetVLAD with weighted triplet loss for place recognition IEEE Trans Neural Netw Learn Syst undefined undefined-undefined
[27]  
Du J(2018)Local deep-feature alignment for unsupervised dimension reduction IEEE Trans Image Process undefined undefined-undefined
[28]  
Liu X(undefined)undefined undefined undefined undefined-undefined
[29]  
Felzenszwalb PF(undefined)undefined undefined undefined undefined-undefined
[30]  
Huttenlocher DP(undefined)undefined undefined undefined undefined-undefined