Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation

被引:1
|
作者
Wang, Qiuyue [1 ]
Zhang, Songyang [1 ]
He, Xuming [1 ]
机构
[1] ShanghaiTech Univ, Shanghai, Peoples R China
关键词
Few-shot Video Instance Segmentation; Few-shot Object Detection; Few-shot Learning;
D O I
10.1109/ICIP46576.2022.9897620
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional video instance segmentation (VIS) aims to detect, segment, and track object instances from a known class set in videos. In real-world applications, however, video instance segmentation typically need to cope with novel-class instances and to fast adapt with a few labeled videos. In this work, we aim to tackle the task of few-shot video instance segmentation (FVIS), which is challenging due to large variations in object appearance and motion. We propose a robust temporally coherent strategy, termed as VTFA, based on a two-stage fine-tuning approach. VTFA enforces the instance segmentation of novel classes to be temporally smooth and reduces the classification bias between novel and base classes. The proposed Memory-aware Temporal Context Encoding Module (MTCE) in VTFA encodes the temporal context information, which contributes to the consistency in the final predictions. We also propose a loss named Instance-level Pair-wise Contrastive (IPC) Loss on both the novel and base classes to enhance the robustness of instance classification. To validate our method, we develop a YouTube-VIS-FS benchmark to compare our method with several baselines. The experimental evaluation shows that our strategy is superior or competitive to those strong baselines.
引用
收藏
页码:251 / 255
页数:5
相关论文
共 50 条
  • [1] Incremental Few-Shot Instance Segmentation
    Ganea, Dan Andrei
    Boom, Bas
    Poppe, Ronald
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1185 - 1194
  • [2] Assessing the Generalizability of Temporally-Coherent Echocardiography Video Segmentation
    Chen, Yida
    Zhang, Xiaoyan
    Haggerty, Christopher M.
    Stough, Joshua, V
    MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
  • [3] Dynamic Transformer for Few-shot Instance Segmentation
    Wang, Haochen
    Liu, Jie
    Liu, Yongtuo
    Maji, Subhransu
    Sonke, Jan-Jakob
    Gavves, Efstratios
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2969 - 2977
  • [4] Sandwich Cut: an Algorithm for Temporally-coherent Video Bilayer Segmentation
    Pu, Songtao
    Zha, Hongbin
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1061 - 1066
  • [5] Towards Temporally-Coherent Video Matting
    Bai, Xue
    Wang, Jue
    Simons, David
    COMPUTER VISION/COMPUTER GRAPHICS COLLABORATION TECHNIQUES, MIRAGE 2011, 2011, 6930 : 63 - 74
  • [6] TPN: Triple parts network for few-shot instance segmentation
    Wang, Haotian
    Zhou, Shibin
    Xu, Xinzheng
    Zhang, Guopeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (30) : 46439 - 46455
  • [7] FGN: Fully Guided Network for Few-Shot Instance Segmentation
    Fan, Zhibo
    Yu, Jin-Gang
    Liang, Zhihao
    Ou, Jiarong
    Gao, Changxin
    Xia, Gui-Song
    Li, Yuanqing
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9169 - 9178
  • [8] CellTranspose: Few-shot Domain Adaptation for Cellular Instance Segmentation
    Keaton, Matthew R.
    Zaveri, Ram J.
    Doretto, Gianfranco
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 455 - 466
  • [9] TPN: Triple parts network for few-shot instance segmentation
    Haotian Wang
    Shibin Zhou
    Xinzheng Xu
    Guopeng Zhang
    Multimedia Tools and Applications, 2023, 82 : 46439 - 46455
  • [10] Few-shot video object segmentation with prototype evolution
    Mao, Binjie
    Liu, Xiyan
    Shi, Linsu
    Yu, Jiazhong
    Li, Fei
    Xiang, Shiming
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (10): : 5367 - 5382