Video Object Segmentation without Temporal Information

被引:212
|
作者
Maninis, Kevis-Kokitsi [1 ]
Caelles, Sergi [1 ]
Chen, Yuhua [1 ]
Pont-Tuset, Jordi [1 ]
Leal-Taixe, Laura [2 ]
Cremers, Daniel [2 ]
Van Gool, Luc [1 ]
机构
[1] ETHZ, CH-8092 Zurich, Switzerland
[2] TUM, D-80333 Munich, Germany
基金
欧盟地平线“2020”;
关键词
Video object segmentation; convolutional neural networks; semantic segmentation; instance segmentation;
D O I
10.1109/TPAMI.2018.2838670
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames. When the temporal smoothness is suddenly broken, such as when an object is occluded, or some frames are missing in a sequence, the result of these methods can deteriorate significantly. This paper explores the orthogonal approach of processing each frame independently, i.e., disregarding the temporal information. In particular, it tackles the task of semi-supervised video object segmentation: the separation of an object from the background in a video, given its mask in the first frame. We present Semantic One-Shot Video Object Segmentation (OSVOSS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground segmentation, and finally to learning the appearance of a single annotated object of the test sequence (hence one shot). We show that instance-level semantic information, when combined effectively, can dramatically improve the results of our previous method, OSVOS. We perform experiments on two recent single-object video segmentation databases, which show that OSVOSS is both the fastest and most accurate method in the state of the art. Experiments on multi-object video segmentation show that OSVOSS obtains competitive results.
引用
收藏
页码:1515 / 1530
页数:16
相关论文
共 50 条
  • [31] Temporal Transductive Inference for Few-Shot Video Object Segmentation
    Siam, Mennatullah
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025,
  • [32] Video object segmentation using spatio-temporal deep network
    Ramaswamy, Akshaya
    Gubbi, Jayavardhana
    Balamuralidhar, P.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [33] Temporal video segmentation using unsupervised clustering and semantic object tracking
    Günsel, B
    Ferman, AM
    Tekalp, AM
    JOURNAL OF ELECTRONIC IMAGING, 1998, 7 (03) : 592 - 604
  • [34] G-TRACE: Grouped temporal recalibration for video object segmentation
    Kim, Jiyun
    Kim, Jooho
    Hong, Sungeun
    IMAGE AND VISION COMPUTING, 2024, 147
  • [35] Automatic moving object segmentation based on spatial and temporal information
    Ho, WJ
    Wang, TC
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS IV, 1999, 3846 : 204 - 212
  • [36] Breaking the "Object" in Video Object Segmentation
    Tokmakov, Pavel
    Li, Jie
    Gaidon, Adrien
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22836 - 22845
  • [37] Video Object Extraction Integrating Temporal-Spatial Information
    Zhu, Shiping
    Gao, Jie
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC & MECHANICAL ENGINEERING AND INFORMATION TECHNOLOGY (EMEIT-2012), 2012, 23
  • [38] A video segmentation algorithm based on spatial-temporal information
    Zhu, H
    Li, ZM
    2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 566 - 569
  • [39] Object based segmentation of video using color, motion and spatial information
    Khan, S
    Shah, M
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2001, : 746 - 751
  • [40] Video-Based Sign Language Recognition without Temporal Segmentation
    Huang, Jie
    Zhou, Wengang
    Zhang, Qilin
    Li, Houqiang
    Li, Weiping
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 2257 - 2264