Two-shot Video Object Segmentation

被引:12
|
作者
Yan, Kun [1 ]
Li, Xiao [2 ]
Wei, Fangyun [2 ]
Wang, Jinglu [2 ]
Zhang, Chenbin [1 ]
Wang, Ping [1 ]
Lu, Yan [2 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.00224
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works on video object segmentation (VOS) are trained on densely annotated videos. Nevertheless, acquiring annotations in pixel level is expensive and time-consuming. In this work, we demonstrate the feasibility of training a satisfactory VOS model on sparsely annotated videos-we merely require two labeled frames per training video while the performance is sustained. We term this novel training paradigm as two-shot video object segmentation, or two-shot VOS for short. The underlying idea is to generate pseudo labels for unlabeled frames during training and to optimize the model on the combination of labeled and pseudo-labeled data. Our approach is extremely simple and can be applied to a majority of existing frameworks. We first pre-train a VOS model on sparsely annotated videos in a semi-supervised manner, with the first frame always being a labeled one. Then, we adopt the pre-trained VOS model to generate pseudo labels for all unlabeled frames, which are subsequently stored in a pseudo-label bank. Finally, we retrain a VOS model on both labeled and pseudo-labeled data without any restrictions on the first frame. For the first time, we present a general way to train VOS models on two-shot VOS datasets. By using 7.3% and 2.9% labeled data of YouTube-VOS and DAVIS benchmarks, our approach achieves comparable results in contrast to the counterparts trained on fully labeled set. Code and models are available at https://github.com/ykpku/Two-shot-Video-Object-Segmentation.
引用
收藏
页码:2257 / 2267
页数:11
相关论文
共 50 条
  • [1] IS TWO-SHOT ALL YOU NEED? A LABEL-EFFICIENT APPROACH FOR VIDEO SEGMENTATION IN BREAST ULTRASOUND
    Zeng, Jiajun
    Ni, Dong
    Huang, Ruobing
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [2] One-Shot Video Object Segmentation
    Caelles, S.
    Maninis, K. -K.
    Pont-Tuset, J.
    Leal-Taixe, L.
    Cremers, D.
    Van Gool, L.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5320 - 5329
  • [3] Two-shot measurement of spatial coherence
    Bhattacharjee, Abhinandan
    Aarav, Shaurya
    Jha, Anand K.
    APPLIED PHYSICS LETTERS, 2018, 113 (05)
  • [4] Few-shot video object segmentation with prototype evolution
    Mao, Binjie
    Liu, Xiyan
    Shi, Linsu
    Yu, Jiazhong
    Li, Fei
    Xiang, Shiming
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5367 - 5382
  • [5] Few-shot video object segmentation with prototype evolution
    Binjie Mao
    Xiyan Liu
    Linsu Shi
    Jiazhong Yu
    Fei Li
    Shiming Xiang
    Neural Computing and Applications, 2024, 36 : 5367 - 5382
  • [6] Two-Shot SVBRDF Capture for Stationary Materials
    Aittala, Miika
    Weyrich, Tim
    Lehtinen, Jaakko
    ACM TRANSACTIONS ON GRAPHICS, 2015, 34 (04):
  • [7] Two-shot insert molding press debuts
    Anon
    Modern Plastics, 2001, 78 (10):
  • [8] INTERFACE CONDITIONS OF TWO-SHOT MOLDED PARTS
    Kisslinger, Thomas
    Bruckmoser, Katharina
    Lucyshyn, Thomas
    Langecker, Guenter Ruediger
    Resch, Katharina
    Holzer, Clemens
    PROCEEDINGS OF PPS-29: THE 29TH INTERNATIONAL CONFERENCE OF THE POLYMER - CONFERENCE PAPERS, 2014, 1593 : 170 - 174
  • [9] Two-shot cocktail: Adenosine, dopamine and a twist of βγ
    R. Adron Harris
    Richard A. Morrisett
    Nature Medicine, 2002, 8 : 777 - 779
  • [10] Video object segmentation and two kinds of object-oriented video coder
    Weng, Nanshan
    Cai, Dejun
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2000, 28 (10): : 106 - 110