Improved fine-grained image classification in few-shot learning based on channel-spatial attention and grouped bilinear convolution

被引:2
作者
Zeng, Ziwei [1 ]
Li, Lihong [1 ]
Zhao, Zoufei [1 ]
Liu, Qingqing [1 ]
机构
[1] Hebei Univ Engn, Sch Informat & Elect Engn, Handan 056038, Hebei, Peoples R China
关键词
Fine-grained image classification; Grouped bilinear convolution; Few-shot; Channel-spatial interaction weighting; NETWORK;
D O I
10.1007/s00371-024-03650-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the context of the complexities of fine-grained image classification intertwined with the constraints of few-shot learning, this paper focuses on overcoming the challenges posed by subtle inter-class differences. To enhance the model's capability to recognize key visual patterns, such as eyes and beaks, this research ingeniously integrates spatial and channel attention mechanisms along with grouped bilinear convolution techniques to adapt to the few-shot learning environment. Specifically, a novel neural network architecture is designed that integrates channel and spatial information, and interactively applies these two types of information to collaboratively optimize the weights of channel and spatial attention. Additionally, to further explore the complex dependencies among features, a grouped bilinear convolution strategy is introduced. This algorithm divides the weighted feature maps into multiple independent groups, where bilinear operations are performed within each group. This strategy captures higher-order feature interactions while reducing network parameters. Comprehensive experiments conducted on three fine-grained benchmark datasets for two few-shot tasks demonstrate the superiority of our algorithm in handling fine-grained features. Notably, in the experiments on the Stanford Cars dataset, a classification accuracy of 95.42% was achieved, confirming its effectiveness and applicability in few-shot learning scenarios. Codes are available at: https://github.com/204503zzw/atb.
引用
收藏
页码:4129 / 4141
页数:13
相关论文
共 51 条
[1]  
Chen W., 2019, 7 INT C LEARN REPR I
[2]   Class attention network for image recognition [J].
Cheng, Gong ;
Lai, Pujian ;
Gao, Decheng ;
Han, Junwei .
SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (03)
[3]   Disentangled Feature Representation for Few-Shot Image Classification [J].
Cheng, Hao ;
Wang, Yufei ;
Li, Haoliang ;
Kot, Alex C. ;
Wen, Bihan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) :10422-10435
[4]  
Dosovitskiy A, 2021, INT C LEARN REPR ICL
[5]  
Finn C, 2017, PR MACH LEARN RES, V70
[6]   Compact Bilinear Pooling [J].
Gao, Yang ;
Beijbom, Oscar ;
Zhang, Ning ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326
[7]   A Spectral and Spatial Attention Network for Change Detection in Hyperspectral Images [J].
Gong, Maoguo ;
Jiang, Fenlong ;
Qin, A. K. ;
Liu, Tongfei ;
Zhan, Tao ;
Lu, Di ;
Zheng, Hanhong ;
Zhang, Mingyang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[8]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[9]   TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization With Few Labeled Samples [J].
Huang, Huaxi ;
Zhang, Junjie ;
Yu, Litao ;
Zhang, Jian ;
Wu, Qiang ;
Xu, Chang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) :853-866
[10]   Low-Rank Pairwise Alignment Bilinear Network For Few-Shot Fine-Grained Image Classification [J].
Huang, Huaxi ;
Zhang, Junjie ;
Zhang, Jian ;
Xu, Jingsong ;
Wu, Qiang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :1666-1680