Boosting image classification through semantic attention filtering strategies

被引：16

作者：

Fidalgo, Eduardo ^{[1
,3
]}

Alegre, Enrique ^{[1
,3
]}

Gonzalez-Castro, Victor ^{[1
,3
]}

Fernandez-Robles, Laura ^{[2
,3
]}

机构：

[1] Univ Leon, Dept Ingn Elect & Sistemas & Automat, Leon, Spain

[2] Univ Leon, Dept Ingn Mecan Informat & Aeroespacial, Leon, Spain

[3] INCIBE Spanish Natl Cybersecur Inst, Leon, Spain

来源：

PATTERN RECOGNITION LETTERS | 2018年 / 112卷

关键词：

Saliency map; Bag of words; Mean shift; Support vector machine; Image classification;

D O I：

10.1016/j.patrec.2018.06.033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Saliency Maps, frequently used to highlight significant information, can be combined with other paradigms, such as Bag of Visual Words (BoVW), to improve image description when the saliency regions correspond closely with the objects of interest. In this paper, we present three attention filtering strategies based on their saliency map that improve image classification using the BoVW framework, Spatial Pyramid Matching (SPM) and Convolutional Neural Networks (CNN) features. Firstly, we demonstrate how the blurring factor used in the Hou's image signature algorithm determines what information remains and impacts to the obtained accuracy in image classification. Next, we propose AutoBlur, a simple but effective approach to automatically select this factor. Then, based on AutoBlur, we introduce two variants of our approach SARF (Semantic Attention Region Filtering), to semantically remove non-relevant regions through a Mean Shift segmentation. The first one is based on the intersection of the Hou's image attention areas with its Mean Shift segmentation, while the second one discards regions using a key point voting system that relies on the Euclidean distance. The experiments carried out showed that the methods of Semantic Attention Filtering that we are proposing could be successfully used with both BoVW, SPM and CNN's in most of the evaluated situations. In the five datasets assessed, all the three proposed methods outperform the baseline when using BoVWs in almost every case. For Spatial Pyramid Matching, the behaviour is similar, finding that the baseline is superior to our proposals in only one of the datasets used. In the case of CNN's, our filtering proposal outperforms the baseline in two datasets, being very similar to it in the other cases. (c) 2018 Elsevier B.V. All rights reserved.

引用

页码：176 / 183

页数：8

共 50 条

[21] Nonlocal spatial attention module for image classification
Chen, Bingling
Huang, Yan
Xia, Qiaoqiao
Zhang, Qinglin
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (05)
[22] Wavelet-Attention CNN for image classification
Zhao, Xiangyu
Huang, Peng
Shu, Xiangbo
MULTIMEDIA SYSTEMS, 2022, 28 (03) : 915 - 924
[23] DeepTree: Pathological Image Classification Through Imitating Tree-Like Strategies of Pathologists
Li, Jiawen
Cheng, Junru
Meng, Lingqin
Yan, Hui
He, Yonghong
Shi, Huijuan
Guan, Tian
Han, Anjia
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2024, 43 (04) : 1501 - 1512
[24] DEEP BOOSTING: LAYERED FEATURE MINING FOR GENERAL IMAGE CLASSIFICATION
Peng, Zhanglin
Lin, Liang
Zhang, Ruimao
Xu, Jing
2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
[25] Hardware Accelerator for Boosting Convolution Computation in Image Classification Applications
Chang, Meng-Chou
Pan, Ze-Gang
Chen, Jyun-Liang
2017 IEEE 6TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2017,
[26] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
Li, Wenhao
Zhu, Hongqing
Yang, Suyi
Wang, Pengyu
Zhang, Han
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (23) : 21387 - 21401
[27] Adaptive Filtering Techniques for Improving Hyperspectral Image Classification
Amorim, Paulo
Moraes, Thiago
Silva, Jorge
Pedrini, Helio
NEW ADVANCES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, 2016, 444 : 889 - 898
[28] Semantic-Aware Triplet Loss for Image Classification
Wang, Guangzhi
Guo, Yangyang
Xu, Ziwei
Wong, Yongkang
Kankanhalli, Mohan S.
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4563 - 4572
[29] Multiclass Unlearning for Image Classification via Weight Filtering
Poppi, Samuele
Sarto, Sara
Cornia, Marcella
Baraldi, Lorenzo
Cucchiara, Rita
IEEE INTELLIGENT SYSTEMS, 2024, 39 (06) : 40 - 47
[30] Multifeature Analysis and Semantic Context Learning for Image Classification
Zhang, Qianni
Izquierdo, Ebroul
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2013, 9 (02)

← 1 2 3 4 5 →