Learning more discriminative clues with gradual attention for fine-grained visual categorization

被引：1

作者：

Xu, Qin ^{[1
,2
]}

Zhang, Mengquan ^{[1
,2
]}

Li, Yun ^{[1
,2
]}

Tao, Zhifu ^{[3
]}

机构：

[1] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China

[2] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

[3] Anhui Univ, Sch Big Data & Stat, Hefei 230601, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2023年 / 136卷

基金：

中国国家自然科学基金;

关键词：

Fine-grained visual categorization; Convolutional neural network; Visual attention; Self -calibrated convolution; IMAGE CLASSIFICATION; NETWORK; MODEL; CNN;

D O I：

10.1016/j.imavis.2023.104753

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-grained visual categorization, which aims to identify the different subcategories of images within the same category, is a very challenging task due to the large intra-class differences and subtle inter-class variances. The existing methods mostly focus on the salient local regions and ignore other features which probably help to recognize the images more precisely. To address this issue, in this paper, we propose a novel end-to-end network composed of the self-calibrated convolution, gradual attention module and feature inverse module for fine-grained visual categorization. To extract the salient features, the self-calibrated convolution is exploited which can avoid the influence of irrelevant information and locate salient regions more accurately. In aiming to extract the discriminative features, we propose the gradual attention module which consists of alternate channel-spatial attention and hierarchical feature grouping. The gradual attention module can extract the subtle discriminative features gradually even when the semantic information of shallow stages is not rich. Moreover, we design the feature inverse module which forces the next stage of network to search for other different useful features by feature inverse. The gradual attention module combined with the feature inverse module is capable of finding more detailed regions and of benefit to improving classification performance. Finally, the stage features and fused features are jointly used for classification. The proposed method is evaluated on three classical fine-grained image datasets and compared with a number of state-of-the-art methods. Our method achieves 89.5%, 95.2% and 93.9% accuracies on CUB-200-2011, Stanford Cars and FGVC-Aircraft datasets respectively. The experimental results demonstrate the effectiveness and superiority of the proposed method.

引用

页数：14

共 50 条

[41] A collaborative gated attention network for fine-grained visual classification
Zhu, Qiangxi
Kuang, Wenlan
Li, Zhixin
DISPLAYS, 2023, 79
[42] Diversified Visual Attention Networks for Fine-Grained Object Classification
Zhao, Bo
Wu, Xiao
Feng, Jiashi
Peng, Qiang
Yan, Shuicheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (06) : 1245 - 1256
[43] A Novel Smart Lightweight Visual Attention Model for Fine-Grained Vehicle Recognition
Boukerche, Azzedine
Ma, Xiren
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 13846 - 13862
[44] Fine-Grained Visual Categorization Using Meta-learning Optimization with Sample Selection of Auxiliary Data
Zhang, Yabin
Tang, Hui
Jia, Kui
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 241 - 256
[45] Fine-Grained Visual-Textual Representation Learning
He, Xiangteng
Peng, Yuxin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (02) : 520 - 531
[46] Posture-guided part learning for fine-grained image categorization
Song, Wei
Chen, Dongmei
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33013
[47] Hierarchical deep transfer learning for fine-grained categorization on micro datasets
Wang, Ronggui
Yao, Xuchen
Yang, Juan
Xue, Lixia
Hu, Min
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 129 - 139
[48] Searching and Learning Discriminative Regions for Fine-Grained Image Retrieval and Classification
Sun, Kangbo
Zhu, Jie
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (01) : 141 - 149
[49] The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking
Ye, Shuo
Wang, Yu
Peng, Qinmu
You, Xinge
Chen, C. L. Philip
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 2 - 16
[50] Global Information-Assisted Fine-Grained Visual Categorization in Internet of Things
Li, Ang
Kang, Bin
Chen, Jianxin
Wu, Dan
Zhou, Liang
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (01) : 940 - 952

← 1 2 3 4 5 →