Adaptive scene dependent filters for segmentation and online learning of visual objects

被引:4
作者
Steil, J. J.
Goetting, M.
Wersing, H.
Koerner, E.
Ritter, H.
机构
[1] Univ Bielefeld, Fac Technol, Neuroinformat Grp, D-33501 Bielefeld, Germany
[2] Honda Res Inst GmbH, D-63073 Offenbach, Germany
关键词
visual online learning; unsupervised image segmentation; vector quantization; cognitive vision; object recognition; human-machine interaction;
D O I
10.1016/j.neucom.2006.11.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose the adaptive scene dependent filter (ASDF) hierarchy for unsupervised learning of image segmentation, which integrates several processing pathways into a flexible, highly dynamic, and real-time capable vision architecture. It is based on forming a combined feature space from basic feature maps like, color, disparity, and pixel position. To guarantee real-time performance, we apply an enhanced vector quantization method to partition this feature space. The learned codebook defines corresponding best-match segments for each prototype and yields an over-segmentation of the object and the surround. The segments are recombined into a final object segmentation mask based on a relevance map, which encodes a coarse bottom-up hypothesis where the object is located in the image. We apply the ASDF hierarchy for preprocessing input images in a feature-based biologically motivated object recognition learning architecture and show experiments with this real-time vision system running at 6 Hz including the online learning of the segmentation. Because interaction with user is not perfect, the real-world system acquires useful views effectively only at about 1.5 Hz, but we show that for training a new object one hundred views taking only one minute of interaction time is sufficient. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1235 / 1246
页数:12
相关论文
共 33 条
  • [1] BORENSTEIN E, 2004, CVPRW, V4, P46
  • [2] Object segmentation by top-down processes
    Bravo, MJ
    Farid, H
    [J]. VISUAL COGNITION, 2003, 10 (04) : 471 - 491
  • [3] Breazeal C, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1146
  • [4] Image coding using transform vector quantization with training set synthesis
    Comaniciu, D
    Grisel, R
    [J]. SIGNAL PROCESSING, 2002, 82 (11) : 1649 - 1663
  • [5] Color clustering and learning for image segmentation based on neural networks
    Dong, G
    Xie, M
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (04): : 925 - 936
  • [6] DRISCOLL J, 1998, P IEEE RSJ INT C INT
  • [7] Fritsch J, 2002, IEEE ROMAN 2002, PROCEEDINGS, P337, DOI 10.1109/ROMAN.2002.1045645
  • [8] Fritzke B., 1995, ADV NEURAL INFORMATI, V7, P625
  • [9] GOERICK C, 2005, P IEEE HUMANOIDS
  • [10] Heidemann G., 2005, Pattern Recognition and Image Analysis, V15, P55