A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition

被引：0

作者：

Dakshayani Himabindu D. ^{[1
,2
]}

Praveen Kumar S. ^{[1
]}

机构：

[1] Department of CSE, GIT, GITAM University

[2] Department of IT, VNRVJIET

来源：

Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in) | 1600年 / Brno University of Technology卷 / 27期

关键词：

Channel Attention; Deep Learning; Fine-Grained Visual Recognition; Image Classification; Spatial Attention; Visual Attention;

D O I：

10.13164/mendel.2021.2.059

中图分类号：

学科分类号：

摘要：

In the recent advancements attention mechanism in deep learning had played a vital role in proving better results in tasks under computer vision. There exists multiple kinds of works under attention mechanism which includes under image classification, fine-grained visual recognition, image captioning, video captioning, object detection and recognition tasks. Global and local attention are the two attention based mechanisms which helps in interpreting the attentive partial. Considering this criteria, there exists channel and spatial attention where in channel attention considers the most attentive channel among the produced block of channels and spatial attention considers which region among the space needs to be focused on. We have proposed a streamlined attention block module which helps in enhancing the feature based learning with less number of additional layers i.e., a GAP layer followed by a linear layer with an incorporation of second order pooling (GSoP) after every layer in the utilized encoder. This mechanism has produced better range dependencies by the conducted experimentation. We have experimented our model on CIFAR-10, CIFAR-100 and FGVC-Aircrafts datasets considering finegrained visual recognition. We were successful in achieving state-of-the-result for FGVC-Aircrafts with an accuracy of 97%. © 2021, Brno University of Technology. All rights reserved.

引用

页码：59 / 67

页数：8

共 50 条

[31] Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition
Sun, Ming
Yuan, Yuchen
Zhou, Feng
Ding, Errui
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 834 - 850
[32] ACANet: A Fine-grained Image Classification Optimization Method Based on Convolution and Attention Fusion
Tan, Zhi
Xu, Zi-Hao
Journal of Computers (Taiwan), 2024, 35 (01) : 17 - 31
[33] A sparse focus framework for visual fine-grained classification
YongXiong Wang
Guangjun Li
Li Ma
Multimedia Tools and Applications, 2021, 80 : 25271 - 25289
[34] A sparse focus framework for visual fine-grained classification
Wang, YongXiong
Li, Guangjun
Ma, Li
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 25271 - 25289
[35] Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition
Yu, Chaojian
Zhao, Xinyi
Zheng, Qi
Zhang, Peng
You, Xinge
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 595 - 610
[36] Text-Embedded Bilinear Model for Fine-Grained Visual Recognition
Sun, Liang
Guan, Xiang
Yang, Yang
Zhang, Lei
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 211 - 219
[37] DEEP DICTIONARY LEARNING FOR FINE-GRAINED IMAGE CLASSIFICATION
Srinivas, M.
Lin, Yen-Yu
Liao, Hong-Yuan Mark
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 835 - 839
[38] Separated smooth sampling for fine-grained image classification
Rong, Shenghai
Wang, Zilei
Wang, Jie
NEUROCOMPUTING, 2021, 461 : 350 - 359
[39] Improving Fine-Grained Image Classification With Multimodal Information
Xu, Jie
Zhang, Xiaoqian
Zhao, Changming
Geng, Zili
Feng, Yuren
Miao, Ke
Li, Yunji
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2082 - 2095
[40] Robust fine-grained image classification with noisy labels
Tan, Xinxing
Dong, Zemin
Zhao, Hualing
VISUAL COMPUTER, 2022, 39 (11) : 5637 - 5650

← 1 2 3 4 5 →