Boosting image classification through semantic attention filtering strategies

被引：16

作者：

Fidalgo, Eduardo ^{[1
,3
]}

Alegre, Enrique ^{[1
,3
]}

Gonzalez-Castro, Victor ^{[1
,3
]}

Fernandez-Robles, Laura ^{[2
,3
]}

机构：

[1] Univ Leon, Dept Ingn Elect & Sistemas & Automat, Leon, Spain

[2] Univ Leon, Dept Ingn Mecan Informat & Aeroespacial, Leon, Spain

[3] INCIBE Spanish Natl Cybersecur Inst, Leon, Spain

来源：

PATTERN RECOGNITION LETTERS | 2018年 / 112卷

关键词：

Saliency map; Bag of words; Mean shift; Support vector machine; Image classification;

D O I：

10.1016/j.patrec.2018.06.033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Saliency Maps, frequently used to highlight significant information, can be combined with other paradigms, such as Bag of Visual Words (BoVW), to improve image description when the saliency regions correspond closely with the objects of interest. In this paper, we present three attention filtering strategies based on their saliency map that improve image classification using the BoVW framework, Spatial Pyramid Matching (SPM) and Convolutional Neural Networks (CNN) features. Firstly, we demonstrate how the blurring factor used in the Hou's image signature algorithm determines what information remains and impacts to the obtained accuracy in image classification. Next, we propose AutoBlur, a simple but effective approach to automatically select this factor. Then, based on AutoBlur, we introduce two variants of our approach SARF (Semantic Attention Region Filtering), to semantically remove non-relevant regions through a Mean Shift segmentation. The first one is based on the intersection of the Hou's image attention areas with its Mean Shift segmentation, while the second one discards regions using a key point voting system that relies on the Euclidean distance. The experiments carried out showed that the methods of Semantic Attention Filtering that we are proposing could be successfully used with both BoVW, SPM and CNN's in most of the evaluated situations. In the five datasets assessed, all the three proposed methods outperform the baseline when using BoVWs in almost every case. For Spatial Pyramid Matching, the behaviour is similar, finding that the baseline is superior to our proposals in only one of the datasets used. In the case of CNN's, our filtering proposal outperforms the baseline in two datasets, being very similar to it in the other cases. (c) 2018 Elsevier B.V. All rights reserved.

引用

页码：176 / 183

页数：8

共 50 条

[31] Image classification by search with explicitly and implicitly semantic representations
Zhang, Chunjie
Zhu, Guibo
Huang, Qingming
Tian, Qi
INFORMATION SCIENCES, 2017, 376 : 125 - 135
[32] An Algorithm for Image Classification Based on Semantic Transfer Learning
Du, Tianming
Wang, Xiaoru
Du, Junping
Wang, Yuanyou
ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING: FUTURE INFORMATION TECHNOLOGY, 2015, 352 : 249 - 256
[33] Joint image representation and classification in random semantic spaces
Zhang, Chunjie
Zhu, Xiaobin
Li, Liang
Zhang, Yifan
Liu, Jing
Huang, Qingming
Tian, Qi
NEUROCOMPUTING, 2015, 156 : 79 - 85
[34] Gaussian Mixture Model with Semantic Distance for Image Classification
Wu, Wei
Gao, Guanglai
Nie, Jianyun
26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 1687 - 1691
[35] Personalized Image Classification by Semantic Embedding and Active Learning †
Song, Mofei
ENTROPY, 2020, 22 (11) : 1 - 26
[36] Double Attention for Multi-Label Image Classification
Zhao, Haiying
Zhou, Wei
Hou, Xiaogang
Zhu, Hui
IEEE ACCESS, 2020, 8 : 225539 - 225550
[37] Adaptive hybrid attention network for hyperspectral image classification *
Pande, Shivam
Banerjee, Biplab
PATTERN RECOGNITION LETTERS, 2021, 144 : 6 - 12
[38] Weakly Supervised Image Classification Based on Attention Mechanism
Cheng, Xiaohui
Liu, Pengfei
Chen, Shouxue
PROCEEDINGS OF 2020 IEEE 2ND INTERNATIONAL CONFERENCE ON CIVIL AVIATION SAFETY AND INFORMATION TECHNOLOGY (ICCASIT), 2020, : 630 - 634
[39] Multiscale attention for few-shot image classification
Zhou, Tong
Dong, Changyin
Song, Junshu
Zhang, Zhiqiang
Wang, Zhen
Chang, Bo
Chen, Dechun
COMPUTATIONAL INTELLIGENCE, 2024, 40 (02)
[40] The Latent Semantic Power of Labels: Improving Image Classification via Natural Language Semantic
Jia, Haosen
Yao, Hong
Tian, Tian
Yan, Cheng
Li, Shengwen
HUMAN CENTERED COMPUTING, 2019, 11956 : 175 - 189

← 1 2 3 4 5 →