Ventral-stream-like shape representation: from pixel intensity values to trainable object-selective COSFIRE models

被引:17
作者
Azzopardi, George [1 ]
Petkov, Nicolai [1 ]
机构
[1] Univ Groningen, Johann Bernoulli Inst Math & Comp Sci, Intelligent Syst, NL-9700 AV Groningen, Netherlands
关键词
hierarchical representation; object recognition; shape; ventral stream; vision and scene understanding; robotics; handwriting analysis; RECOGNITION; ORGANIZATION; CONTOUR;
D O I
10.3389/fncom.2014.00080
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The remarkable abilities of the primate visual system have inspired the construction of computational models of some visual neurons. We propose a trainable hierarchical object recognition model, which we call S-COSFIRE (S stands for Shape and COSFIRE stands for Combination Of Shifted Filter REsponses) and use it to localize and recognize objects of interests embedded in complex scenes. It is inspired by the visual processing in the ventral stream (V1/V2 -> V4 -> TEO). Recognition and localization of objects embedded in complex scenes is important for many computer vision applications. Most existing methods require prior segmentation of the objects from the background which on its turn requires recognition. An S-COSFIRE filter is automatically configured to be selective for an arrangement of contour-based features that belong to a prototype shape specified by an example. The configuration comprises selecting relevant vertex detectors and determining certain blur and shift parameters. The response is computed as the weighted geometric mean of the blurred and shifted responses of the selected vertex detectors. S-COSFIRE filters share similar properties with some neurons in inferotemporal cortex, which provided inspiration for this work. We demonstrate the effectiveness of S-COSFIRE filters in two applications: letter and keyword spotting in handwritten manuscripts and object spotting in complex scenes for the computer vision system of a domestic robot. S-COSFIRE filters are effective to recognize and localize (deformable) objects in images of complex scenes without requiring prior segmentation. They are versatile trainable shape detectors, conceptually simple and easy to implement. The presented hierarchical shape representation contributes to a better understanding of the brain and to more robust computer vision algorithms.
引用
收藏
页数:9
相关论文
共 39 条
[1]   A non-rigid appearance model for shape description and recognition [J].
Almazan, Jon ;
Fornes, Alicia ;
Valveny, Ernest .
PATTERN RECOGNITION, 2012, 45 (09) :3105-3113
[2]  
[Anonymous], P IEEE C COMP VIS PA
[3]  
[Anonymous], 1982, Visual perception
[4]   A Push-Pull CORF Model of a Simple Cell with Antiphase Inhibition Improves SNR and Contour Detection [J].
Azzopardi, George ;
Rodriguez-Sanchez, Antonio ;
Piater, Justus ;
Petkov, Nicolai .
PLOS ONE, 2014, 9 (07)
[5]   Automatic detection of vascular bifurcations in segmented retinal images using trainable COSFIRE filters [J].
Azzopardi, George ;
Petkov, Nicolai .
PATTERN RECOGNITION LETTERS, 2013, 34 (08) :922-933
[6]   Trainable COSFIRE Filters for Keypoint Detection and Pattern Recognition [J].
Azzopardi, George ;
Petkov, Nicolai .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) :490-503
[7]   A CORF computational model of a simple cell that relies on LGN input outperforms the Gabor function model [J].
Azzopardi, George ;
Petkov, Nicolai .
BIOLOGICAL CYBERNETICS, 2012, 106 (03) :177-189
[8]   A SURVEY OF VISION-BASED ARCHITECTURES FOR ROBOT LEARNING BY IMITATION [J].
Bandera, J. P. ;
Rodriguez, J. A. ;
Molina-Tanco, L. ;
Bandera, A. .
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2012, 9 (01)
[9]   Shape matching and object recognition using shape contexts [J].
Belongie, S ;
Malik, J ;
Puzicha, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522
[10]   Underlying principles of visual shape selectivity in posterior inferotemporal cortex [J].
Brincat, SL ;
Connor, CE .
NATURE NEUROSCIENCE, 2004, 7 (08) :880-886