Self Supervised Progressive Network for High Performance Video Object Segmentation

被引：4

作者：

Li, Guorong ^{[1
]}

Hong, Dexiang ^{[1
]}

Xu, Kai ^{[1
]}

Zhong, Bineng ^{[2
]}

Su, Li ^{[1
]}

Han, Zhenjun ^{[3
]}

Huang, Qingming ^{[1
]}

机构：

[1] Univ ChineseAcademy Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China

[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China

[3] Univ Chinese Acad Sci UCAS, Sch Elect Elect & Commun Engn, Beijing 101408, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Customer relationship management; Semantics; Object segmentation; Collaboration; Visualization; Decoding; Cycle consistency; self-supervised; similarity learning; video object segmentation (VOS); TRACKING;

D O I：

10.1109/TNNLS.2022.3219936

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, self-supervised video object segmentation (VOS) has attracted much interest. However, most proxy tasks are proposed to train only a single backbone, which relies on a point-to-point correspondence strategy to propagate masks through a video sequence. Due to its simple pipeline, the performance of the single backbone paradigm is still unsatisfactory. Instead of following the previous literature, we propose our self-supervised progressive network (SSPNet) which consists of a memory retrieval module (MRM) and collaborative refinement module (CRM). The MRM can perform point-to-point correspondence and produce a propagated coarse mask for a query frame through self-supervised pixel-level and frame-level similarity learning. The CRM, which is trained via cycle consistency region tracking, aggregates the reference & query information and learns the collaborative relationship among them implicitly to refine the coarse mask. Furthermore, to learn semantic knowledge from unlabeled data, we also design two novel mask-generation strategies to provide the training data with meaningful semantic information for the CRM. Extensive experiments conducted on DAVIS-17, YouTube-VOS and SegTrack v2 demonstrate that our method surpasses the state-of-the-art self-supervised methods and narrows the gap with the fully supervised methods.

引用

页码：7671 / 7684

页数：14

共 50 条

[41] Adaptive convolutional neural network for large change in video object segmentation
Yin, Hui
Yang, Lin
Xu, Hongli
Wan, Jin
IET COMPUTER VISION, 2019, 13 (05) : 452 - 460
[42] Hierarchical Video Object Segmentation
Xing, Junliang
Ai, Haizhou
Lao, Shihong
2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 67 - 71
[43] A neural network approach to Bayesian background modeling for video object segmentation
Culibrk, Dubravko
Marques, Oge
Socek, Daniel
Kalva, Hari
Furht, Borko
VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2006, : 474 - +
[44] Automatic cardiac evaluations using a deep video object segmentation network
Sirjani, Nasim
Moradi, Shakiba
Oghli, Mostafa Ghelich
Hosseinsabet, Ali
Alizadehasl, Azin
Yadollahi, Mona
Shiri, Isaac
Shabanzadeh, Ali
INSIGHTS INTO IMAGING, 2022, 13 (01)
[45] Boosting Video Object Segmentation via Robust and Efficient Memory Network
Chen, Yadang
Zhang, Dingwei
Zheng, Yuhui
Yang, Zhi-Xin
Wu, Enhua
Zhao, Haixing
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3340 - 3352
[46] A semi-supervised recurrent neural network for video salient object detection
Kompella, Aditya
Kulkarni, Raghavendra, V
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06): : 2065 - 2083
[47] Weakly Supervised Instance Segmentation by Exploring Entire Object Regions
Zhang, Ke
Yuan, Chun
Zhu, Yiming
Jiang, Yong
Luo, Lishu
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 352 - 363
[48] Scale-Aware Feature Network for Weakly Supervised Semantic Segmentation
Xu, Lian
Bennamoun, Mohammed
Boussaid, Farid
Sohel, Ferdous
IEEE ACCESS, 2020, 8 : 75957 - 75967
[49] Online Meta Adaptation for Fast Video Object Segmentation
Xiao, Huaxin
Kang, Bingyi
Liu, Yu
Zhang, Maojun
Feng, Jiashi
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1205 - 1217
[50] Adaptive Selection of Reference Frames for Video Object Segmentation
Hong, Lingyi
Zhang, Wei
Chen, Liangyu
Zhang, Wenqiang
Fan, Jianping
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1057 - 1071

← 1 2 3 4 5 →