Weakly Supervised Learning of Visual Models and Its Application to Content-Based Retrieval

被引：0

作者：

Cordelia Schmid

机构：

[1] INRIA Rhône-Alpes,

来源：

International Journal of Computer Vision | 2004年 / 56卷

关键词：

visual model; two-layer image description; weakly supervised learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper presents a method for weakly supervised learning of visual models. The visual model is based on a two-layer image description: a set of “generic” descriptors and their distribution over neighbourhoods. “Generic” descriptors represent sets of similar rotational invariant feature vectors. Statistical spatial constraints describe the neighborhood structure and make our description more discriminant. The joint probability of the frequencies of “generic” descriptors over a neighbourhood is multi-modal and is represented by a set of “neighbourhood-frequency” clusters. Our image description is rotationally invariant, robust to model deformations and characterizes efficiently “appearance-based” visual structure. The selection of distinctive clusters determines model features (common to the positive and rare in the negative examples). Visual models are retrieved and localized using a probabilistic score. Experimental results for “textured” animals and faces show a very good performance for retrieval as well as localization.

引用

页码：7 / 16

页数：9

共 27 条

[1] Amit Y.(1999)A computational model for visual selection Neural Computation 11 1691-1715
[2] Geman D.(1997)Performance of phase-based algorithms for disparity estimation Machine Vision and Applications 9 334-340
[3] Cozzi A.(1946)Theory of communication Journal I.E.E. 3 429-457
[4] Crespi B.(1991)Unsupervised texture segmentation using Gabor filters Pattern Recognition 24 1167-1186
[5] Valentinotti F.(1987)Representation of local geometry in the visual system Biological Cybernetics 55 367-375
[6] Worgotter F.(2003)Sparse texture representation using affine-invariant neighborhoods Proceedings of the Conference on Computer Vision and Pattern Recognition II 313-324
[7] Gabor D.(1998)Feature detection with automatic scale selection International Journal of Computer Vision 30 79-116
[8] Jain A.K.(2000)A trainable system for object detection International Journal of Computer Vision 38 15-33
[9] Farrokhnia F.(1999)Texture-based image retrieval without segmentation Proceedings of the 7th International Conference on Computer Vision 2 1018-1024
[10] Koenderink J.J.(2001)Constructing models for content-based image retrieval Proceedings of the Conference on Computer Vision and Pattern Recognition II 39-45

← 1 2 3 →