Multiple spatial pooling for visual object recognition

被引:12
|
作者
Huang, Yongzhen [1 ]
Wu, Zifeng [1 ]
Wang, Liang [1 ]
Song, Chunfeng [1 ]
机构
[1] Chinese Acad Sci CASIA, Inst Automat, NLPR, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Object classification; Spatial modeling; Multiple pooling; SPARSE REPRESENTATION; FEATURES; MANIFOLD;
D O I
10.1016/j.neucom.2013.09.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Global spatial structure is an important factor for visual object recognition but has not attracted sufficient attention in recent studies. Especially, the problems of features' ambiguity and sensitivity to location change in the image space are not yet well solved. In this paper, we propose multiple spatial pooling (MSP) to address these problems. MSP models global spatial structure with multiple Gaussian distributions and then pools features according to the relations between features and Gaussian distributions. Such a process is further generalized into a unified framework, which formulates multiple pooling using matrix operation with structured sparsity. Experiments in terms of scene classification and object categorization demonstrate that MSP can enhance traditional algorithms with small extra computational cost. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:225 / 231
页数:7
相关论文
共 50 条
  • [1] Visual object recognition in multiple sclerosis
    Laatu, S
    Revonsuo, A
    Hämäläinen, P
    Ojanen, V
    Ruutiainen, J
    JOURNAL OF THE NEUROLOGICAL SCIENCES, 2001, 185 (02) : 77 - 88
  • [2] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    COMPUTER VISION - ECCV 2014, PT III, 2014, 8691 : 346 - 361
  • [3] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [4] The role of spatial attention in visual object recognition
    Shyi, GCW
    Cheng, SK
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 4841 - 4841
  • [5] Multiple Visual Object Recognition For Poster Detection
    Kuzhan, Abdullah
    Ozden, Kemal Egemen
    ICECCO'12: 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION, 2012, : 301 - 304
  • [6] Indoor Scene Recognition With a Visual Attention-Driven Spatial Pooling Strategy
    Elguebaly, Tarek
    Bouguila, Nizar
    2014 CANADIAN CONFERENCE ON COMPUTER AND ROBOT VISION (CRV), 2014, : 268 - 275
  • [7] Multiple Kernel Learning for Visual Object Recognition: A Review
    Bucak, Serhat S.
    Jin, Rong
    Jain, Anil K.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (07) : 1354 - 1369
  • [8] Bayes pooling of visual phrases for object retrieval
    Jiang, Wenhui
    Zhao, Zhicheng
    Su, Fei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (15) : 9095 - 9119
  • [9] Bayes pooling of visual phrases for object retrieval
    Wenhui Jiang
    Zhicheng Zhao
    Fei Su
    Multimedia Tools and Applications, 2016, 75 : 9095 - 9119
  • [10] The Dorsal Visual Pathway Represents Object-Centered Spatial Relations for Object Recognition
    Ayzenberg, Vladislav
    Behrmann, Marlene
    JOURNAL OF NEUROSCIENCE, 2022, 42 (23): : 4693 - 4710