Self Supervised Progressive Network for High Performance Video Object Segmentation

被引:4
|
作者
Li, Guorong [1 ]
Hong, Dexiang [1 ]
Xu, Kai [1 ]
Zhong, Bineng [2 ]
Su, Li [1 ]
Han, Zhenjun [3 ]
Huang, Qingming [1 ]
机构
[1] Univ ChineseAcademy Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
[3] Univ Chinese Acad Sci UCAS, Sch Elect Elect & Commun Engn, Beijing 101408, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Customer relationship management; Semantics; Object segmentation; Collaboration; Visualization; Decoding; Cycle consistency; self-supervised; similarity learning; video object segmentation (VOS); TRACKING;
D O I
10.1109/TNNLS.2022.3219936
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, self-supervised video object segmentation (VOS) has attracted much interest. However, most proxy tasks are proposed to train only a single backbone, which relies on a point-to-point correspondence strategy to propagate masks through a video sequence. Due to its simple pipeline, the performance of the single backbone paradigm is still unsatisfactory. Instead of following the previous literature, we propose our self-supervised progressive network (SSPNet) which consists of a memory retrieval module (MRM) and collaborative refinement module (CRM). The MRM can perform point-to-point correspondence and produce a propagated coarse mask for a query frame through self-supervised pixel-level and frame-level similarity learning. The CRM, which is trained via cycle consistency region tracking, aggregates the reference & query information and learns the collaborative relationship among them implicitly to refine the coarse mask. Furthermore, to learn semantic knowledge from unlabeled data, we also design two novel mask-generation strategies to provide the training data with meaningful semantic information for the CRM. Extensive experiments conducted on DAVIS-17, YouTube-VOS and SegTrack v2 demonstrate that our method surpasses the state-of-the-art self-supervised methods and narrows the gap with the fully supervised methods.
引用
收藏
页码:7671 / 7684
页数:14
相关论文
共 50 条
  • [41] Adaptive convolutional neural network for large change in video object segmentation
    Yin, Hui
    Yang, Lin
    Xu, Hongli
    Wan, Jin
    IET COMPUTER VISION, 2019, 13 (05) : 452 - 460
  • [42] Hierarchical Video Object Segmentation
    Xing, Junliang
    Ai, Haizhou
    Lao, Shihong
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 67 - 71
  • [43] A neural network approach to Bayesian background modeling for video object segmentation
    Culibrk, Dubravko
    Marques, Oge
    Socek, Daniel
    Kalva, Hari
    Furht, Borko
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2006, : 474 - +
  • [44] Automatic cardiac evaluations using a deep video object segmentation network
    Sirjani, Nasim
    Moradi, Shakiba
    Oghli, Mostafa Ghelich
    Hosseinsabet, Ali
    Alizadehasl, Azin
    Yadollahi, Mona
    Shiri, Isaac
    Shabanzadeh, Ali
    INSIGHTS INTO IMAGING, 2022, 13 (01)
  • [45] Boosting Video Object Segmentation via Robust and Efficient Memory Network
    Chen, Yadang
    Zhang, Dingwei
    Zheng, Yuhui
    Yang, Zhi-Xin
    Wu, Enhua
    Zhao, Haixing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3340 - 3352
  • [46] A semi-supervised recurrent neural network for video salient object detection
    Kompella, Aditya
    Kulkarni, Raghavendra, V
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06): : 2065 - 2083
  • [47] Weakly Supervised Instance Segmentation by Exploring Entire Object Regions
    Zhang, Ke
    Yuan, Chun
    Zhu, Yiming
    Jiang, Yong
    Luo, Lishu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 352 - 363
  • [48] Scale-Aware Feature Network for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    IEEE ACCESS, 2020, 8 : 75957 - 75967
  • [49] Online Meta Adaptation for Fast Video Object Segmentation
    Xiao, Huaxin
    Kang, Bingyi
    Liu, Yu
    Zhang, Maojun
    Feng, Jiashi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1205 - 1217
  • [50] Adaptive Selection of Reference Frames for Video Object Segmentation
    Hong, Lingyi
    Zhang, Wei
    Chen, Liangyu
    Zhang, Wenqiang
    Fan, Jianping
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1057 - 1071