Boosting image classification through semantic attention filtering strategies

被引:16
作者
Fidalgo, Eduardo [1 ,3 ]
Alegre, Enrique [1 ,3 ]
Gonzalez-Castro, Victor [1 ,3 ]
Fernandez-Robles, Laura [2 ,3 ]
机构
[1] Univ Leon, Dept Ingn Elect & Sistemas & Automat, Leon, Spain
[2] Univ Leon, Dept Ingn Mecan Informat & Aeroespacial, Leon, Spain
[3] INCIBE Spanish Natl Cybersecur Inst, Leon, Spain
关键词
Saliency map; Bag of words; Mean shift; Support vector machine; Image classification;
D O I
10.1016/j.patrec.2018.06.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Saliency Maps, frequently used to highlight significant information, can be combined with other paradigms, such as Bag of Visual Words (BoVW), to improve image description when the saliency regions correspond closely with the objects of interest. In this paper, we present three attention filtering strategies based on their saliency map that improve image classification using the BoVW framework, Spatial Pyramid Matching (SPM) and Convolutional Neural Networks (CNN) features. Firstly, we demonstrate how the blurring factor used in the Hou's image signature algorithm determines what information remains and impacts to the obtained accuracy in image classification. Next, we propose AutoBlur, a simple but effective approach to automatically select this factor. Then, based on AutoBlur, we introduce two variants of our approach SARF (Semantic Attention Region Filtering), to semantically remove non-relevant regions through a Mean Shift segmentation. The first one is based on the intersection of the Hou's image attention areas with its Mean Shift segmentation, while the second one discards regions using a key point voting system that relies on the Euclidean distance. The experiments carried out showed that the methods of Semantic Attention Filtering that we are proposing could be successfully used with both BoVW, SPM and CNN's in most of the evaluated situations. In the five datasets assessed, all the three proposed methods outperform the baseline when using BoVWs in almost every case. For Spatial Pyramid Matching, the behaviour is similar, finding that the baseline is superior to our proposals in only one of the datasets used. In the case of CNN's, our filtering proposal outperforms the baseline in two datasets, being very similar to it in the other cases. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 183
页数:8
相关论文
共 50 条
  • [31] Image classification by search with explicitly and implicitly semantic representations
    Zhang, Chunjie
    Zhu, Guibo
    Huang, Qingming
    Tian, Qi
    INFORMATION SCIENCES, 2017, 376 : 125 - 135
  • [32] An Algorithm for Image Classification Based on Semantic Transfer Learning
    Du, Tianming
    Wang, Xiaoru
    Du, Junping
    Wang, Yuanyou
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURE INFORMATION TECHNOLOGY, 2015, 352 : 249 - 256
  • [33] Joint image representation and classification in random semantic spaces
    Zhang, Chunjie
    Zhu, Xiaobin
    Li, Liang
    Zhang, Yifan
    Liu, Jing
    Huang, Qingming
    Tian, Qi
    NEUROCOMPUTING, 2015, 156 : 79 - 85
  • [34] Gaussian Mixture Model with Semantic Distance for Image Classification
    Wu, Wei
    Gao, Guanglai
    Nie, Jianyun
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 1687 - 1691
  • [35] Personalized Image Classification by Semantic Embedding and Active Learning †
    Song, Mofei
    ENTROPY, 2020, 22 (11) : 1 - 26
  • [36] Double Attention for Multi-Label Image Classification
    Zhao, Haiying
    Zhou, Wei
    Hou, Xiaogang
    Zhu, Hui
    IEEE ACCESS, 2020, 8 : 225539 - 225550
  • [37] Adaptive hybrid attention network for hyperspectral image classification *
    Pande, Shivam
    Banerjee, Biplab
    PATTERN RECOGNITION LETTERS, 2021, 144 : 6 - 12
  • [38] Weakly Supervised Image Classification Based on Attention Mechanism
    Cheng, Xiaohui
    Liu, Pengfei
    Chen, Shouxue
    PROCEEDINGS OF 2020 IEEE 2ND INTERNATIONAL CONFERENCE ON CIVIL AVIATION SAFETY AND INFORMATION TECHNOLOGY (ICCASIT), 2020, : 630 - 634
  • [39] Multiscale attention for few-shot image classification
    Zhou, Tong
    Dong, Changyin
    Song, Junshu
    Zhang, Zhiqiang
    Wang, Zhen
    Chang, Bo
    Chen, Dechun
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
  • [40] The Latent Semantic Power of Labels: Improving Image Classification via Natural Language Semantic
    Jia, Haosen
    Yao, Hong
    Tian, Tian
    Yan, Cheng
    Li, Shengwen
    HUMAN CENTERED COMPUTING, 2019, 11956 : 175 - 189