Multiscale attention dynamic aware network for fine-grained visual categorization

被引：0

作者：

Ou, Jichu ^{[1
]}

Li, Wanyi ^{[2
]}

Huang, Jingmin ^{[2
]}

Huang, Xiaojie ^{[2
]}

Xie, Xuan ^{[2
]}

机构：

[1] Guangdong Univ Educ, Sch Math, Guangzhou, Peoples R China

[2] Guangdong Univ Educ, Sch Comp Sci, Guangzhou, Peoples R China

来源：

ELECTRONICS LETTERS | 2023年 / 59卷 / 01期

关键词：

data mining; image classification; image recognition;

D O I：

10.1049/ell2.12696

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Fine-grained visual categorization (FGVC) is a challenging task, facing the issues such as inter-class similarities, large intra-class variances, scale variation, and angle variation. To address these issues, the authors propose a novel multiscale attention dynamic aware network (MADA-Net). The core of network consists of three parallel sub-networks, which learn features from different scales. Each sub-network is composed of three serial sub-modules: (1) A self-attention module (SAM) locates objects according to relative importance scattered throughout feature map. (2) A multiscale feature extractor (MFE) learns the non-linear features of objects. (3) A dynamic aware module (DAM) enhances the learning capability of spatial deformation of the network to generate high-quality feature map. In addition, the authors propose a multiscale adjusted loss (MA-Loss) to improve the performance of network. Experiments on three prevailing benchmark datasets demonstrate that our method can achieve state-of-the-art performance.

引用

页数：3

共 19 条

[1] The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification
Chang, Dongliang
Ding, Yifeng
Xie, Jiyang
Bhunia, Ayan Kumar
Li, Xiaoxu
Ma, Zhanyu
Wu, Ming
Guo, Jun
Song, Yi-Zhe
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4683 - 4695
[2] Dai JY, 2017, IEEE ICC
[3] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[4] AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification
Ding, Yifeng
Ma, Zhanyu
Wen, Shaoguo
Xie, Jiyang
Chang, Dongliang
Si, Zhongwei
Wu, Ming
Ling, Haibin
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2826 - 2836
[5] Hu, 2021, 29 ACM INT C MULT
[6] Huang SL, 2021, AAAI CONF ARTIF INTE, V35, P1628
[7] 3D Object Representations for Fine-Grained Categorization
Krause, Jonathan
Stark, Michael
Deng, Jia
Li Fei-Fei
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 554 - 561
[8] Bilinear CNN Models for Fine-grained Visual Recognition
Lin, Tsung-Yu
RoyChowdhury, Aruni
Maji, Subhransu
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1449 - 1457
[9] Maji S., 2013, Fine -Grained Visual Classification of Aircraft
[10] Multi-Objective Matrix Normalization for Fine-Grained Visual Recognition
Min, Shaobo
Yao, Hantao
Xie, Hongtao
Zha, Zheng-Jun
Zhang, Yongdong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 4996 - 5009

← 1 2 →