Boosting image classification through semantic attention filtering strategies

被引:16
作者
Fidalgo, Eduardo [1 ,3 ]
Alegre, Enrique [1 ,3 ]
Gonzalez-Castro, Victor [1 ,3 ]
Fernandez-Robles, Laura [2 ,3 ]
机构
[1] Univ Leon, Dept Ingn Elect & Sistemas & Automat, Leon, Spain
[2] Univ Leon, Dept Ingn Mecan Informat & Aeroespacial, Leon, Spain
[3] INCIBE Spanish Natl Cybersecur Inst, Leon, Spain
关键词
Saliency map; Bag of words; Mean shift; Support vector machine; Image classification;
D O I
10.1016/j.patrec.2018.06.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Saliency Maps, frequently used to highlight significant information, can be combined with other paradigms, such as Bag of Visual Words (BoVW), to improve image description when the saliency regions correspond closely with the objects of interest. In this paper, we present three attention filtering strategies based on their saliency map that improve image classification using the BoVW framework, Spatial Pyramid Matching (SPM) and Convolutional Neural Networks (CNN) features. Firstly, we demonstrate how the blurring factor used in the Hou's image signature algorithm determines what information remains and impacts to the obtained accuracy in image classification. Next, we propose AutoBlur, a simple but effective approach to automatically select this factor. Then, based on AutoBlur, we introduce two variants of our approach SARF (Semantic Attention Region Filtering), to semantically remove non-relevant regions through a Mean Shift segmentation. The first one is based on the intersection of the Hou's image attention areas with its Mean Shift segmentation, while the second one discards regions using a key point voting system that relies on the Euclidean distance. The experiments carried out showed that the methods of Semantic Attention Filtering that we are proposing could be successfully used with both BoVW, SPM and CNN's in most of the evaluated situations. In the five datasets assessed, all the three proposed methods outperform the baseline when using BoVWs in almost every case. For Spatial Pyramid Matching, the behaviour is similar, finding that the baseline is superior to our proposals in only one of the datasets used. In the case of CNN's, our filtering proposal outperforms the baseline in two datasets, being very similar to it in the other cases. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:176 / 183
页数:8
相关论文
共 50 条
  • [21] Nonlocal spatial attention module for image classification
    Chen, Bingling
    Huang, Yan
    Xia, Qiaoqiao
    Zhang, Qinglin
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (05)
  • [22] Wavelet-Attention CNN for image classification
    Zhao, Xiangyu
    Huang, Peng
    Shu, Xiangbo
    MULTIMEDIA SYSTEMS, 2022, 28 (03) : 915 - 924
  • [23] DeepTree: Pathological Image Classification Through Imitating Tree-Like Strategies of Pathologists
    Li, Jiawen
    Cheng, Junru
    Meng, Lingqin
    Yan, Hui
    He, Yonghong
    Shi, Huijuan
    Guan, Tian
    Han, Anjia
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (04) : 1501 - 1512
  • [24] DEEP BOOSTING: LAYERED FEATURE MINING FOR GENERAL IMAGE CLASSIFICATION
    Peng, Zhanglin
    Lin, Liang
    Zhang, Ruimao
    Xu, Jing
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [25] Hardware Accelerator for Boosting Convolution Computation in Image Classification Applications
    Chang, Meng-Chou
    Pan, Ze-Gang
    Chen, Jyun-Liang
    2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
  • [26] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
    Li, Wenhao
    Zhu, Hongqing
    Yang, Suyi
    Wang, Pengyu
    Zhang, Han
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (23) : 21387 - 21401
  • [27] Adaptive Filtering Techniques for Improving Hyperspectral Image Classification
    Amorim, Paulo
    Moraes, Thiago
    Silva, Jorge
    Pedrini, Helio
    NEW ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2016, 444 : 889 - 898
  • [28] Semantic-Aware Triplet Loss for Image Classification
    Wang, Guangzhi
    Guo, Yangyang
    Xu, Ziwei
    Wong, Yongkang
    Kankanhalli, Mohan S.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4563 - 4572
  • [29] Multiclass Unlearning for Image Classification via Weight Filtering
    Poppi, Samuele
    Sarto, Sara
    Cornia, Marcella
    Baraldi, Lorenzo
    Cucchiara, Rita
    IEEE INTELLIGENT SYSTEMS, 2024, 39 (06) : 40 - 47
  • [30] Multifeature Analysis and Semantic Context Learning for Image Classification
    Zhang, Qianni
    Izquierdo, Ebroul
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2013, 9 (02)