Detecting, segmenting and tracking unknown objects using multi-label MRF inference

被引:34
作者
Bjorkman, Marten [1 ]
Bergstrom, Niklas [1 ]
Kragic, Danica [1 ]
机构
[1] Royal Inst Technol KTH, Sch Comp Sci & Commun CSC, Comp Vis & Act Percept Lab CVAP, Stockholm, Sweden
关键词
Figure-ground segmentation; Active perception; MRF; Multi-object tracking; Object detection; GPU acceleration; ENERGY MINIMIZATION; VISUAL-ATTENTION; ACTIVE CONTOURS; STATISTICAL-ANALYSIS; BELIEF PROPAGATION; MODEL; SEGMENTATION; VISION;
D O I
10.1016/j.cviu.2013.10.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a unified framework for detecting, segmenting and tracking unknown objects in everyday scenes, allowing for inspection of object hypotheses during interaction over time. A heterogeneous scene representation is proposed, with background regions modeled as a combinations of planar surfaces and uniform clutter, and foreground objects as 3D ellipsoids. Recent energy minimization methods based on loopy belief propagation, tree-reweighted message passing and graph cuts are studied for the purpose of multi-object segmentation and benchmarked in terms of segmentation quality, as well as computational speed and how easily methods can be adapted for parallel processing. One conclusion is that the choice of energy minimization method is less important than the way scenes are modeled. Proximities are more valuable for segmentation than similarity in colors, while the benefit of 3D information is limited. It is also shown through practical experiments that, with implementations on GPUs, multi-object segmentation and tracking using state-of-art MRF inference methods is feasible, despite the computational costs typically associated with such methods. (C) 2013 Published by Elsevier Inc.
引用
收藏
页码:111 / 127
页数:17
相关论文
共 77 条
  • [1] [Anonymous], 2010, P BMVC, DOI DOI 10.5244/C.24.119
  • [2] [Anonymous], 2010, P 18 ACM INT C MULT
  • [3] [Anonymous], 1982, P AAAI NATINAL C AI
  • [4] Bagon S, 2008, LECT NOTES COMPUT SC, V5305, P30, DOI 10.1007/978-3-540-88693-8_3
  • [5] iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance
    Batra, Dhruv
    Kowdle, Adarsh
    Parikh, Devi
    Luo, Jiebo
    Chen, Tsuhan
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3169 - 3176
  • [6] Benedetto S., 1995, JPL TDA PROGR REP, P63
  • [7] Bergström N, 2011, IEEE INT C INT ROBOT, P827, DOI 10.1109/IROS.2011.6048162
  • [8] BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
  • [9] BESAG J, 1986, J R STAT SOC B, V48, P259
  • [10] Bibby C, 2008, LECT NOTES COMPUT SC, V5303, P831, DOI 10.1007/978-3-540-88688-4_61