A Streamlined Attention Mechanism for Image Classification and Fine-Grained Visual Recognition

被引:0
|
作者
Dakshayani Himabindu D. [1 ,2 ]
Praveen Kumar S. [1 ]
机构
[1] Department of CSE, GIT, GITAM University
[2] Department of IT, VNRVJIET
来源
Dakshayani Himabindu, D. (dakshayanihimabindu_d@vnrvjiet.in) | 1600年 / Brno University of Technology卷 / 27期
关键词
Channel Attention; Deep Learning; Fine-Grained Visual Recognition; Image Classification; Spatial Attention; Visual Attention;
D O I
10.13164/mendel.2021.2.059
中图分类号
学科分类号
摘要
In the recent advancements attention mechanism in deep learning had played a vital role in proving better results in tasks under computer vision. There exists multiple kinds of works under attention mechanism which includes under image classification, fine-grained visual recognition, image captioning, video captioning, object detection and recognition tasks. Global and local attention are the two attention based mechanisms which helps in interpreting the attentive partial. Considering this criteria, there exists channel and spatial attention where in channel attention considers the most attentive channel among the produced block of channels and spatial attention considers which region among the space needs to be focused on. We have proposed a streamlined attention block module which helps in enhancing the feature based learning with less number of additional layers i.e., a GAP layer followed by a linear layer with an incorporation of second order pooling (GSoP) after every layer in the utilized encoder. This mechanism has produced better range dependencies by the conducted experimentation. We have experimented our model on CIFAR-10, CIFAR-100 and FGVC-Aircrafts datasets considering finegrained visual recognition. We were successful in achieving state-of-the-result for FGVC-Aircrafts with an accuracy of 97%. © 2021, Brno University of Technology. All rights reserved.
引用
收藏
页码:59 / 67
页数:8
相关论文
共 50 条
  • [31] Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition
    Sun, Ming
    Yuan, Yuchen
    Zhou, Feng
    Ding, Errui
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 834 - 850
  • [32] ACANet: A Fine-grained Image Classification Optimization Method Based on Convolution and Attention Fusion
    Tan, Zhi
    Xu, Zi-Hao
    Journal of Computers (Taiwan), 2024, 35 (01) : 17 - 31
  • [33] A sparse focus framework for visual fine-grained classification
    YongXiong Wang
    Guangjun Li
    Li Ma
    Multimedia Tools and Applications, 2021, 80 : 25271 - 25289
  • [34] A sparse focus framework for visual fine-grained classification
    Wang, YongXiong
    Li, Guangjun
    Ma, Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 25271 - 25289
  • [35] Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition
    Yu, Chaojian
    Zhao, Xinyi
    Zheng, Qi
    Zhang, Peng
    You, Xinge
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 595 - 610
  • [36] Text-Embedded Bilinear Model for Fine-Grained Visual Recognition
    Sun, Liang
    Guan, Xiang
    Yang, Yang
    Zhang, Lei
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 211 - 219
  • [37] DEEP DICTIONARY LEARNING FOR FINE-GRAINED IMAGE CLASSIFICATION
    Srinivas, M.
    Lin, Yen-Yu
    Liao, Hong-Yuan Mark
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 835 - 839
  • [38] Separated smooth sampling for fine-grained image classification
    Rong, Shenghai
    Wang, Zilei
    Wang, Jie
    NEUROCOMPUTING, 2021, 461 : 350 - 359
  • [39] Improving Fine-Grained Image Classification With Multimodal Information
    Xu, Jie
    Zhang, Xiaoqian
    Zhao, Changming
    Geng, Zili
    Feng, Yuren
    Miao, Ke
    Li, Yunji
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2082 - 2095
  • [40] Robust fine-grained image classification with noisy labels
    Tan, Xinxing
    Dong, Zemin
    Zhao, Hualing
    VISUAL COMPUTER, 2022, 39 (11) : 5637 - 5650