WEIGHTED BAG OF VISUAL WORDS FOR OBJECT RECOGNITION

被引：0

作者：

San Biagio, Marco ^{[1
]}

Bazzani, Loris ^{[1
,2
]}

Cristani, Marco ^{[1
,2
]}

Murino, Vittorio ^{[1
,2
]}

机构：

[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis, Via Morego 30, I-16163 Genoa, Italy

[2] Univ Verona, Dept Informat, I-37134 Verona, Italy

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2014年

关键词：

object recognition; dictionary learning; visual saliency; feature weighting;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Bag of Visual words (BoV) is one of the most successful strategy for object recognition, used to represent an image as a vector of counts using a learned vocabulary. This strategy assumes that the representation is built using patches that are either densely extracted or sampled from the images using feature detectors. However, the dense strategy captures also the noisy background information, whereas the feature detection strategy can lose important parts of the objects. In this paper we propose a solution in-between these two strategies, by densely extracting patches from the image, and weighting them accordingly to their salience. Intuitively, highly salient patches have an important role in describing an object, while those with low saliency are still taken with low emphasis, instead of discarding them. We embed this idea in the word encoding mechanism adopted in the BoV approaches. The technique is successfully applied to vector quantization and Fisher vector, on Caltech-101 and Caltech-256.

引用

页码：2734 / 2738

页数：5

共 50 条

[41] Bag of ARSRG Words (BoAW)
Manzo, Mario
Pellino, Simone
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (03):
[42] Visual Predictive Architecture for Biologically Inspired Object Recognition
Malowany, Dan
Guterman, Hugo
2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
[43] Cognitive Semantic Model for Visual Object Recognition in Image
Tan, Sieow Yeek
Lukose, Dickson
MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 67 - 78
[44] Distinct but related abilities for visual and haptic object recognition
Chow, Jason K.
Palmeri, Thomas J.
Gauthier, Isabel
PSYCHONOMIC BULLETIN & REVIEW, 2024, 31 (05) : 2148 - 2159
[45] VFM: Visual Feedback Model for Robust Object Recognition
Wang, Chong
Huang, Kai-Qi
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (02) : 325 - 339
[46] VFM: Visual Feedback Model for Robust Object Recognition
Chong Wang
Kai-Qi Huang
Journal of Computer Science and Technology, 2015, 30 : 325 - 339
[47] Learning the Compositional Nature of Visual Object Categories for Recognition
Ommer, Bjoern
Buhmann, Joachim M.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (03) : 501 - 516
[48] Structure preserving dimensionality reduction for visual object recognition
Song, Jinjoo
Yoon, Gangjoon
Cho, Heeryon
Yoon, Sang Min
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23529 - 23545
[49] Structure preserving dimensionality reduction for visual object recognition
Jinjoo Song
Gangjoon Yoon
Heeryon Cho
Sang Min Yoon
Multimedia Tools and Applications, 2018, 77 : 23529 - 23545
[50] A VISUAL OBJECT RECOGNITION SYSTEM INVARIANT TO SCALE AND ROTATION
Sato, Yasuomi D.
Jitsev, Jenia
von der Malsburg, Christoph
NEURAL NETWORK WORLD, 2009, 19 (05) : 529 - 544

← 1 2 3 4 5 →