Multiscale attention dynamic aware network for fine-grained visual categorization

被引:0
作者
Ou, Jichu [1 ]
Li, Wanyi [2 ]
Huang, Jingmin [2 ]
Huang, Xiaojie [2 ]
Xie, Xuan [2 ]
机构
[1] Guangdong Univ Educ, Sch Math, Guangzhou, Peoples R China
[2] Guangdong Univ Educ, Sch Comp Sci, Guangzhou, Peoples R China
关键词
data mining; image classification; image recognition;
D O I
10.1049/ell2.12696
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fine-grained visual categorization (FGVC) is a challenging task, facing the issues such as inter-class similarities, large intra-class variances, scale variation, and angle variation. To address these issues, the authors propose a novel multiscale attention dynamic aware network (MADA-Net). The core of network consists of three parallel sub-networks, which learn features from different scales. Each sub-network is composed of three serial sub-modules: (1) A self-attention module (SAM) locates objects according to relative importance scattered throughout feature map. (2) A multiscale feature extractor (MFE) learns the non-linear features of objects. (3) A dynamic aware module (DAM) enhances the learning capability of spatial deformation of the network to generate high-quality feature map. In addition, the authors propose a multiscale adjusted loss (MA-Loss) to improve the performance of network. Experiments on three prevailing benchmark datasets demonstrate that our method can achieve state-of-the-art performance.
引用
收藏
页数:3
相关论文
共 19 条
  • [1] The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification
    Chang, Dongliang
    Ding, Yifeng
    Xie, Jiyang
    Bhunia, Ayan Kumar
    Li, Xiaoxu
    Ma, Zhanyu
    Wu, Ming
    Guo, Jun
    Song, Yi-Zhe
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4683 - 4695
  • [2] Dai JY, 2017, IEEE ICC
  • [3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [4] AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification
    Ding, Yifeng
    Ma, Zhanyu
    Wen, Shaoguo
    Xie, Jiyang
    Chang, Dongliang
    Si, Zhongwei
    Wu, Ming
    Ling, Haibin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2826 - 2836
  • [5] Hu, 2021, 29 ACM INT C MULT
  • [6] Huang SL, 2021, AAAI CONF ARTIF INTE, V35, P1628
  • [7] 3D Object Representations for Fine-Grained Categorization
    Krause, Jonathan
    Stark, Michael
    Deng, Jia
    Li Fei-Fei
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 554 - 561
  • [8] Bilinear CNN Models for Fine-grained Visual Recognition
    Lin, Tsung-Yu
    RoyChowdhury, Aruni
    Maji, Subhransu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1449 - 1457
  • [9] Maji S., 2013, Fine -Grained Visual Classification of Aircraft
  • [10] Multi-Objective Matrix Normalization for Fine-Grained Visual Recognition
    Min, Shaobo
    Yao, Hantao
    Xie, Hongtao
    Zha, Zheng-Jun
    Zhang, Yongdong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 4996 - 5009