Gamifying Video Object Segmentation

被引：9

作者：

Spampinato, Concetto ^{[1
]}

Palazzo, Simone ^{[1
]}

Giordano, Daniela ^{[1
]}

机构：

[1] Univ Catania, Dept Elect Elect & Comp Engn, Pattern Recognit & Comp Vis Lab PeRCeiVe Lab, I-95124 Catania, Italy

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2017年 / 39卷 / 10期

关键词：

Interactive video annotation; games with a purpose; human in the loop; spatio-temporal superpixel segmentation; EXTRACTION; TRACKING;

D O I：

10.1109/TPAMI.2016.2610973

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video object segmentation can be considered as one of the most challenging computer vision problems. Indeed, so far, no existing solution is able to effectively deal with the peculiarities of real-world videos, especially in cases of articulated motion and object occlusions; limitations that appear more evident when we compare the performance of automated methods with the human one. However, manually segmenting objects in videos is largely impractical as it requires a lot of time and concentration. To address this problem, in this paper we propose an interactive video object segmentation method, which exploits, on one hand, the capability of humans to identify correctly objects in visual scenes, and on the other hand, the collective human brainpower to solve challenging and large-scale tasks. In particular, our method relies on a game with a purpose to collect human inputs on object locations, followed by an accurate segmentation phase achieved by optimizing an energy function encoding spatial and temporal constraints between object regions as well as human-provided location priors. Performance analysis carried out on complex video benchmarks, and exploiting data provided by over 60 users, demonstrated that our method shows a better trade-off between annotation times and segmentation accuracy than interactive video annotation and automated video object segmentation approaches.

引用

页码：1942 / 1958

页数：17

共 54 条

[1]

Achanta R., 2010, EPFL Technical Report 149300, V6, P15

[2]

Addis M., 2010, P 1 INT DIG PRES INT

[3]

[Anonymous], 2008, IEEE Conf. Comput. Vis. Pattern Recognit, DOI DOI 10.1109/CVPR.2008.4587581

[4]

[Anonymous], VISION RES, DOI [DOI 10.1016/S0042-6989(96)00269-6, 10.1016/s0042-6989(96)00269-6]

[5] Mixture of Trees Probabilistic Graphical Model for Video Segmentation [J].

Badrinarayanan, Vijay ;

Budvytis, Ignas ;

Cipolla, Roberto .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 110 (01) :14-29

[6] Label Propagation in Video Sequences [J].

Badrinarayanan, Vijay ;

Galasso, Fabio ;

Cipolla, Roberto .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3265-3272

[7]

Bai X, 2010, LECT NOTES COMPUT SC, V6315, P617

[8] ViBe: A Universal Background Subtraction Algorithm for Video Sequences [J].

Barnich, Olivier ;

Van Droogenbroeck, Marc .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (06) :1709-1724

[9]

Brox T, 2010, LECT NOTES COMPUT SC, V6315, P282, DOI 10.1007/978-3-642-15555-0_21

[10]

Budvytis I., 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2257, DOI 10.1109/CVPR.2011.5995600

← 1 2 3 4 5 6 →