Two-shot Video Object Segmentation

被引:12
|
作者
Yan, Kun [1 ]
Li, Xiao [2 ]
Wei, Fangyun [2 ]
Wang, Jinglu [2 ]
Zhang, Chenbin [1 ]
Wang, Ping [1 ]
Lu, Yan [2 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.00224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works on video object segmentation (VOS) are trained on densely annotated videos. Nevertheless, acquiring annotations in pixel level is expensive and time-consuming. In this work, we demonstrate the feasibility of training a satisfactory VOS model on sparsely annotated videos-we merely require two labeled frames per training video while the performance is sustained. We term this novel training paradigm as two-shot video object segmentation, or two-shot VOS for short. The underlying idea is to generate pseudo labels for unlabeled frames during training and to optimize the model on the combination of labeled and pseudo-labeled data. Our approach is extremely simple and can be applied to a majority of existing frameworks. We first pre-train a VOS model on sparsely annotated videos in a semi-supervised manner, with the first frame always being a labeled one. Then, we adopt the pre-trained VOS model to generate pseudo labels for all unlabeled frames, which are subsequently stored in a pseudo-label bank. Finally, we retrain a VOS model on both labeled and pseudo-labeled data without any restrictions on the first frame. For the first time, we present a general way to train VOS models on two-shot VOS datasets. By using 7.3% and 2.9% labeled data of YouTube-VOS and DAVIS benchmarks, our approach achieves comparable results in contrast to the counterparts trained on fully labeled set. Code and models are available at https://github.com/ykpku/Two-shot-Video-Object-Segmentation.
引用
收藏
页码:2257 / 2267
页数:11
相关论文
共 50 条
  • [21] A shot boundary detection method for news video based on object segmentation and tracking
    Xu, Xin-Wen
    Li, Guo-Hui
    Yuan, Jian
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2470 - 2475
  • [22] Hierarchical Graph Pattern Understanding for Zero-Shot Video Object Segmentation
    Pei, Gensheng
    Shen, Fumin
    Yao, Yazhou
    Chen, Tao
    Hua, Xian-Sheng
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5909 - 5920
  • [23] Motion-Attentive Transition for Zero-Shot Video Object Segmentation
    Zhou, Tianfei
    Wang, Shunzhou
    Zhou, Yi
    Yao, Yazhou
    Li, Jianwu
    Shao, Ling
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13066 - 13073
  • [24] Bayesian video shot segmentation
    Vasconcelos, N
    Lippman, A
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1009 - 1015
  • [25] Video shot segmentation and classification
    Gong, YH
    Liu, X
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 860 - 863
  • [26] REGION ADAPTIVE TWO-SHOT NETWORK FOR SINGLE IMAGE DEHAZING
    Li, Hui
    Wu, Qingbo
    Ngan, King Ngi
    Li, Hongliang
    Meng, Fanman
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [27] Invisible PAB Door Development Using Two-Shot Molding
    Byung Seok Kong
    Dong Kyou Park
    International Journal of Automotive Technology, 2019, 20 : 221 - 225
  • [28] Spurious signals in DQF spectroscopy: two-shot stimulated echoes
    Pictet, J
    van der Klink, JJ
    Meuli, R
    MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2004, 17 (02) : 74 - 79
  • [29] Spurious signals in DQF spectroscopy: two-shot stimulated echoes
    Jacqueline Pictet
    J. J. van der Klink
    R. Meuli
    Magnetic Resonance Materials in Physics, Biology and Medicine, 2004, 17 : 74 - 79
  • [30] Invisible PAB Door Development Using Two-Shot Molding
    Kong, Byung Seok
    Park, Dong Kyou
    INTERNATIONAL JOURNAL OF AUTOMOTIVE TECHNOLOGY, 2019, 20 (02) : 221 - 225