Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks

被引:13
|
作者
Liu, Ziyi [1 ]
Wang, Le [1 ]
Hua, Gang [2 ]
Zhang, Qilin [3 ]
Niu, Zhenxing [4 ]
Wu, Ying [5 ]
Zheng, Nanning [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Shaanxi, Peoples R China
[2] Microsoft Res, Redmond, WA 98052 USA
[3] HERE Technol, Chicago, IL 60606 USA
[4] Alibaba Grp, Hangzhou 311121, Zhejiang, Peoples R China
[5] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
基金
中国博士后科学基金; 中国国家自然科学基金; 美国国家科学基金会;
关键词
Object segmentation; object discovery; dynamic Markov networks; probabilistic graphical model; CO-SEGMENTATION; OPTICAL-FLOW; RECOGNITION; HISTOGRAMS;
D O I
10.1109/TIP.2018.2859622
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is a challenging task to extract segmentation mask of a target from a single noisy video, which involves object discovery coupled with segmentation. To solve this challenge, we present a method to jointly discover and segment an object from a noisy video, where the target disappears intermittently throughout the video. Previous methods either only fulfill video object discovery, or video object segmentation presuming the existence of the object in each frame. We argue that jointly conducting the two tasks in a unified way will be beneficial. In other words, video object discovery and video object segmentation tasks can facilitate each other. To validate this hypothesis, we propose a principled probabilistic model, where two dynamic Markov networks are coupled-one for discovery and the other for segmentation. When conducting the Bayesian inference on this model using belief propagation, the bi-directional message passing reveals a clear collaboration between these two inference tasks. We validated our proposed method in five data sets. The first three video data sets, i.e., the SegTrack data set, the YouTube-objects data set, and the Davis data set, are not noisy, where all video frames contain the objects. The two noisy data sets, i.e., the XJTU-Stevens data set, and the Noisy-ViDiSeg data set, newly introduced in this paper, both have many frames that do not contain the objects. When compared with state of the art, it is shown that although our method produces inferior results on video data sets without noisy frames, we are able to obtain better results on video data sets with noisy frames.
引用
收藏
页码:5840 / 5853
页数:14
相关论文
共 50 条
  • [1] Video Object Discovery and Co-Segmentation with Extremely Weak Supervision
    Wang, Le
    Hua, Gang
    Sukthankar, Rahul
    Xue, Jianru
    Niu, Zhenxing
    Zheng, Nanning
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (10) : 2074 - 2088
  • [2] Adversarial Attacks on Video Object Segmentation With Hard Region Discovery
    Li, Ping
    Zhang, Yu
    Yuan, Li
    Zhao, Jian
    Xu, Xianghua
    Zhang, Xiaoqin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5049 - 5062
  • [3] JOINT OBJECT DISCOVERY AND SEGMENTATION WITH IMAGE-WISE RECONSTRUCTION ERROR
    Tarashima, Shuhei
    Pan, Jingjing
    Irie, Go
    Kurozumi, Takayuki
    Kinebuchi, Tetsuya
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 849 - 853
  • [4] Video Object Segmentation using Optical Flow and Recurrent Neural Networks
    Kalezic, Mirko
    Sekulic, Petar
    Kovacevic, Slavko
    2020 9TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2020, : 455 - 458
  • [5] Joint Stereo Video Deblurring, Scene Flow Estimation and Moving Object Segmentation
    Pan, Liyuan
    Dai, Yuchao
    Liu, Miaomiao
    Porikli, Fatih
    Pan, Quan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (1748-1761) : 1748 - 1761
  • [6] Joint Multisource Saliency and Exemplar Mechanism for Weakly Supervised Video Object Segmentation
    En, Qing
    Duan, Lijuan
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 8155 - 8169
  • [7] Guided Co-Segmentation Network for Fast Video Object Segmentation
    Liu, Weide
    Lin, Guosheng
    Zhang, Tianyi
    Liu, Zichuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1607 - 1617
  • [8] Unsupervised regions based segmentation using object discovery
    Yang, Bai
    Yu, Huimin
    Hu, Roland
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 31 : 125 - 137
  • [9] Unsupervised Object Discovery and Segmentation in Videos
    Schulter, Samuel
    Leistner, Christian
    Roth, Peter M.
    Bischof, Horst
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [10] Video object segmentation using SVMs
    Zhao, Y
    Li, HL
    Ahalt, SC
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 333 - 337