Semi-Supervised Video Object Segmentation Based on Local and Global Consistency Learning

被引:0
作者
Liang, Huagang [1 ]
Liu, Lihua [1 ]
Bo, Ying [1 ]
Zuo, Chao [1 ]
机构
[1] Changan Univ, Coll Elect & Control Engn, Xian 710064, Peoples R China
关键词
Deep learning; video object segmentation; conduction model; high-speed monitoring video;
D O I
10.1109/ACCESS.2021.3112014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the variety of video types and different quality on the Internet, it brings more challenges to video processing algorithms such as video object segmentation. Most existing video object segmentation methods rely on modules in other fields as an additional structure of the segmentation model. The combination of modules can improve the accuracy of the model, but it will also reduce the algorithm speed. This paper proposes a semi-supervised video object segmentation method based on local and global consistency learning, which does not rely on additional structures to achieve fast segmentation. First, we extract the embedding features of the image based on GhostNet which is the lightweight network. By using the embedded features of pixels, the graph model is established based on the similarity between pixels. Second, we adopt the local-global consistency learning framework to construct the label conduction model. Third, to optimize the memory occupation and inference speed of the model, we propose a sampling strategy for reference frames by considering local and global information. Finally, we establish a high-speed monitoring video dataset to verify the practical application effect of the method. Our method achieves a result of 69.5% J&F mean with 46 FPS on DAVIS 2017 dataset. At the same time, this paper constructed a high-speed monitoring video dataset. The algorithm obtained 68.2% J&F on this dataset, indicating that the method has good generalization and robust performance in practical applications.
引用
收藏
页码:127293 / 127304
页数:12
相关论文
共 29 条
[1]   Fast Temporal Video Segmentation Based on Krawtchouk-Tchebichef Moments [J].
Abdulhussain, Sadiq H. ;
Al-Haddad, Syed Abdul Rahman ;
Saripan, M. Iqbal ;
Mahmmod, Basheera M. ;
Hussien, Aseel .
IEEE ACCESS, 2020, 8 :72347-72359
[2]  
[Anonymous], 2014, P ADV NEUR INF PROC
[3]  
[Anonymous], 2009, P 26 ANN INT C MACHI, DOI DOI 10.1145/1553374.1553516
[4]   CNN in MRF: Video Object Segmentation via Inference in A CNN-Based Higher-Order Spatio-Temporal MRF [J].
Bao, Linchao ;
Wu, Baoyuan ;
Liu, Wei .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5977-5986
[5]   One-Shot Video Object Segmentation [J].
Caelles, S. ;
Maninis, K. -K. ;
Pont-Tuset, J. ;
Leal-Taixe, L. ;
Cremers, D. ;
Van Gool, L. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5320-5329
[6]   Blazingly Fast Video Object Segmentation with Pixel-Wise Metric Learning [J].
Chen, Yuhua ;
Pont-Tuset, Jordi ;
Montes, Alberto ;
Van Gool, Luc .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1189-1198
[7]   BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames [J].
Griffin, Brent A. ;
Corso, Jason J. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8906-8915
[8]  
He K., 2017, IEEE INT C COMPUT VI, P2961, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]
[9]   VideoMatch: Matching Based Video Object Segmentation [J].
Hu, Yuan-Ting ;
Huang, Jia-Bin ;
Schwing, Alexander G. .
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 :56-73
[10]   Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation [J].
Li, Xiaoxiao ;
Loy, Chen Change .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :93-110