Adaptive scene dependent filters for segmentation and online learning of visual objects

被引：4

作者：

Steil, J. J.

Goetting, M.

Wersing, H.

Koerner, E.

Ritter, H.

机构：

[1] Univ Bielefeld, Fac Technol, Neuroinformat Grp, D-33501 Bielefeld, Germany

[2] Honda Res Inst GmbH, D-63073 Offenbach, Germany

来源：

NEUROCOMPUTING | 2007年 / 70卷 / 7-9期

关键词：

visual online learning; unsupervised image segmentation; vector quantization; cognitive vision; object recognition; human-machine interaction;

D O I：

10.1016/j.neucom.2006.11.020

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose the adaptive scene dependent filter (ASDF) hierarchy for unsupervised learning of image segmentation, which integrates several processing pathways into a flexible, highly dynamic, and real-time capable vision architecture. It is based on forming a combined feature space from basic feature maps like, color, disparity, and pixel position. To guarantee real-time performance, we apply an enhanced vector quantization method to partition this feature space. The learned codebook defines corresponding best-match segments for each prototype and yields an over-segmentation of the object and the surround. The segments are recombined into a final object segmentation mask based on a relevance map, which encodes a coarse bottom-up hypothesis where the object is located in the image. We apply the ASDF hierarchy for preprocessing input images in a feature-based biologically motivated object recognition learning architecture and show experiments with this real-time vision system running at 6 Hz including the online learning of the segmentation. Because interaction with user is not perfect, the real-world system acquires useful views effectively only at about 1.5 Hz, but we show that for training a new object one hundred views taking only one minute of interaction time is sufficient. (c) 2007 Elsevier B.V. All rights reserved.

引用

页码：1235 / 1246

页数：12

共 33 条

[1] BORENSTEIN E, 2004, CVPRW, V4, P46
[2] Object segmentation by top-down processes
Bravo, MJ
Farid, H
[J]. VISUAL COGNITION, 2003, 10 (04) : 471 - 491
[3] Breazeal C, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1146
[4] Image coding using transform vector quantization with training set synthesis
Comaniciu, D
Grisel, R
[J]. SIGNAL PROCESSING, 2002, 82 (11) : 1649 - 1663
[5] Color clustering and learning for image segmentation based on neural networks
Dong, G
Xie, M
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (04): : 925 - 936
[6] DRISCOLL J, 1998, P IEEE RSJ INT C INT
[7] Fritsch J, 2002, IEEE ROMAN 2002, PROCEEDINGS, P337, DOI 10.1109/ROMAN.2002.1045645
[8] Fritzke B., 1995, ADV NEURAL INFORMATI, V7, P625
[9] GOERICK C, 2005, P IEEE HUMANOIDS
[10] Heidemann G., 2005, Pattern Recognition and Image Analysis, V15, P55

← 1 2 3 4 →