WEIGHTED BAG OF VISUAL WORDS FOR OBJECT RECOGNITION

被引:0
|
作者
San Biagio, Marco [1 ]
Bazzani, Loris [1 ,2 ]
Cristani, Marco [1 ,2 ]
Murino, Vittorio [1 ,2 ]
机构
[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis, Via Morego 30, I-16163 Genoa, Italy
[2] Univ Verona, Dept Informat, I-37134 Verona, Italy
来源
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2014年
关键词
object recognition; dictionary learning; visual saliency; feature weighting;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Bag of Visual words (BoV) is one of the most successful strategy for object recognition, used to represent an image as a vector of counts using a learned vocabulary. This strategy assumes that the representation is built using patches that are either densely extracted or sampled from the images using feature detectors. However, the dense strategy captures also the noisy background information, whereas the feature detection strategy can lose important parts of the objects. In this paper we propose a solution in-between these two strategies, by densely extracting patches from the image, and weighting them accordingly to their salience. Intuitively, highly salient patches have an important role in describing an object, while those with low saliency are still taken with low emphasis, instead of discarding them. We embed this idea in the word encoding mechanism adopted in the BoV approaches. The technique is successfully applied to vector quantization and Fisher vector, on Caltech-101 and Caltech-256.
引用
收藏
页码:2734 / 2738
页数:5
相关论文
共 50 条
  • [41] Bag of ARSRG Words (BoAW)
    Manzo, Mario
    Pellino, Simone
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2019, 1 (03):
  • [42] Visual Predictive Architecture for Biologically Inspired Object Recognition
    Malowany, Dan
    Guterman, Hugo
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [43] Cognitive Semantic Model for Visual Object Recognition in Image
    Tan, Sieow Yeek
    Lukose, Dickson
    MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 67 - 78
  • [44] Distinct but related abilities for visual and haptic object recognition
    Chow, Jason K.
    Palmeri, Thomas J.
    Gauthier, Isabel
    PSYCHONOMIC BULLETIN & REVIEW, 2024, 31 (05) : 2148 - 2159
  • [45] VFM: Visual Feedback Model for Robust Object Recognition
    Wang, Chong
    Huang, Kai-Qi
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (02) : 325 - 339
  • [46] VFM: Visual Feedback Model for Robust Object Recognition
    Chong Wang
    Kai-Qi Huang
    Journal of Computer Science and Technology, 2015, 30 : 325 - 339
  • [47] Learning the Compositional Nature of Visual Object Categories for Recognition
    Ommer, Bjoern
    Buhmann, Joachim M.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (03) : 501 - 516
  • [48] Structure preserving dimensionality reduction for visual object recognition
    Song, Jinjoo
    Yoon, Gangjoon
    Cho, Heeryon
    Yoon, Sang Min
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23529 - 23545
  • [49] Structure preserving dimensionality reduction for visual object recognition
    Jinjoo Song
    Gangjoon Yoon
    Heeryon Cho
    Sang Min Yoon
    Multimedia Tools and Applications, 2018, 77 : 23529 - 23545
  • [50] A VISUAL OBJECT RECOGNITION SYSTEM INVARIANT TO SCALE AND ROTATION
    Sato, Yasuomi D.
    Jitsev, Jenia
    von der Malsburg, Christoph
    NEURAL NETWORK WORLD, 2009, 19 (05) : 529 - 544