Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks

被引：13

作者：

Liu, Ziyi ^{[1
]}

Wang, Le ^{[1
]}

Hua, Gang ^{[2
]}

Zhang, Qilin ^{[3
]}

Niu, Zhenxing ^{[4
]}

Wu, Ying ^{[5
]}

Zheng, Nanning ^{[1
]}

机构：

[1] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Shaanxi, Peoples R China

[2] Microsoft Res, Redmond, WA 98052 USA

[3] HERE Technol, Chicago, IL 60606 USA

[4] Alibaba Grp, Hangzhou 311121, Zhejiang, Peoples R China

[5] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2018年 / 27卷 / 12期

基金：

中国博士后科学基金; 中国国家自然科学基金; 美国国家科学基金会;

关键词：

Object segmentation; object discovery; dynamic Markov networks; probabilistic graphical model; CO-SEGMENTATION; OPTICAL-FLOW; RECOGNITION; HISTOGRAMS;

D O I：

10.1109/TIP.2018.2859622

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is a challenging task to extract segmentation mask of a target from a single noisy video, which involves object discovery coupled with segmentation. To solve this challenge, we present a method to jointly discover and segment an object from a noisy video, where the target disappears intermittently throughout the video. Previous methods either only fulfill video object discovery, or video object segmentation presuming the existence of the object in each frame. We argue that jointly conducting the two tasks in a unified way will be beneficial. In other words, video object discovery and video object segmentation tasks can facilitate each other. To validate this hypothesis, we propose a principled probabilistic model, where two dynamic Markov networks are coupled-one for discovery and the other for segmentation. When conducting the Bayesian inference on this model using belief propagation, the bi-directional message passing reveals a clear collaboration between these two inference tasks. We validated our proposed method in five data sets. The first three video data sets, i.e., the SegTrack data set, the YouTube-objects data set, and the Davis data set, are not noisy, where all video frames contain the objects. The two noisy data sets, i.e., the XJTU-Stevens data set, and the Noisy-ViDiSeg data set, newly introduced in this paper, both have many frames that do not contain the objects. When compared with state of the art, it is shown that although our method produces inferior results on video data sets without noisy frames, we are able to obtain better results on video data sets with noisy frames.

引用

页码：5840 / 5853

页数：14

共 50 条

[1] Video Object Discovery and Co-Segmentation with Extremely Weak Supervision
Wang, Le
Hua, Gang
Sukthankar, Rahul
Xue, Jianru
Niu, Zhenxing
Zheng, Nanning
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (10) : 2074 - 2088
[2] Adversarial Attacks on Video Object Segmentation With Hard Region Discovery
Li, Ping
Zhang, Yu
Yuan, Li
Zhao, Jian
Xu, Xianghua
Zhang, Xiaoqin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 5049 - 5062
[3] JOINT OBJECT DISCOVERY AND SEGMENTATION WITH IMAGE-WISE RECONSTRUCTION ERROR
Tarashima, Shuhei
Pan, Jingjing
Irie, Go
Kurozumi, Takayuki
Kinebuchi, Tetsuya
2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 849 - 853
[4] Video Object Segmentation using Optical Flow and Recurrent Neural Networks
Kalezic, Mirko
Sekulic, Petar
Kovacevic, Slavko
2020 9TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2020, : 455 - 458
[5] Joint Stereo Video Deblurring, Scene Flow Estimation and Moving Object Segmentation
Pan, Liyuan
Dai, Yuchao
Liu, Miaomiao
Porikli, Fatih
Pan, Quan
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (1748-1761) : 1748 - 1761
[6] Joint Multisource Saliency and Exemplar Mechanism for Weakly Supervised Video Object Segmentation
En, Qing
Duan, Lijuan
Zhang, Zhaoxiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 8155 - 8169
[7] Guided Co-Segmentation Network for Fast Video Object Segmentation
Liu, Weide
Lin, Guosheng
Zhang, Tianyi
Liu, Zichuan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (04) : 1607 - 1617
[8] Unsupervised regions based segmentation using object discovery
Yang, Bai
Yu, Huimin
Hu, Roland
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2015, 31 : 125 - 137
[9] Unsupervised Object Discovery and Segmentation in Videos
Schulter, Samuel
Leistner, Christian
Roth, Peter M.
Bischof, Horst
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[10] Video object segmentation using SVMs
Zhao, Y
Li, HL
Ahalt, SC
7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING, 2003, : 333 - 337

← 1 2 3 4 5 →