Bi-channel attention meta learning for few-shot fine-grained image recognition

被引:10
作者
Wang, Yao [1 ,2 ]
Ji, Yang [1 ]
Wang, Wei [1 ]
Wang, Bailing [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China
[2] Harbin Inst Technol, Cyberspace Secur Inst, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot learning; Fine-grained image recognition; Meta-learning; Visual attention; NETWORK;
D O I
10.1016/j.eswa.2023.122741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot fine-grained recognition is an attractive research topic that aims to differentiate between subcategories using a limited number of labeled examples. Due to the characteristics of fine-grained images, capturing subtle differences between categories using limited samples is very challenging. Discriminative information is essential for fine-grained image recognition, however, existing methods of few-shot learning usually extract features from each part indiscriminately, resulting in poor performance. To solve this problem, this work presents a compact Bi-channel Attention Meta-learning Model with an embedding module and a feature calibration module. The embedding module can effectively prevent the loss of crucial spatial information, thereby learning better deep descriptors. The feature calibration module consists of two sequentially arranged channel attention blocks, which allow the network selectively enhances discriminative features and compress less useful features with global information. Experiments on three commonly used fine-grained benchmark datasets indicate the efficacy and superiority of the proposed model.
引用
收藏
页数:7
相关论文
共 40 条
[1]   LaSO: Label-Set Operations networks for multi-label few-shot learning [J].
Alfassy, Amit ;
Karlinsky, Leonid ;
Aides, Amit ;
Shtok, Joseph ;
Harary, Sivan ;
Feris, Rogerio ;
Giryes, Raja ;
Bronstein, Alex M. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6541-6550
[2]  
Antoniou A, 2019, Arxiv, DOI arXiv:1810.09502
[3]  
Branson S, 2014, Arxiv, DOI [arXiv:1406.2952, DOI 10.48550/ARXIV.1406.2952, 10.48550/arXiv.1406.2952]
[4]   Symbiotic Segmentation and Part Localization for Fine-Grained Categorization [J].
Chai, Yuning ;
Lempitsky, Victor ;
Zisserman, Andrew .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :321-328
[5]   Spot and Learn: A Maximum-Entropy Patch Sampler for Few-Shot Image Classification [J].
Chu, Wen-Hsuan ;
Li, Yu-Jhe ;
Chang, Jing-Cheng ;
Wang, Yu-Chiang Frank .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6244-6253
[6]  
Feng H, 2018, Arxiv, DOI arXiv:1808.03935
[7]  
Finn C, 2017, PR MACH LEARN RES, V70
[8]   Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition [J].
Fu, Jianlong ;
Zheng, Heliang ;
Mei, Tao .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4476-4484
[9]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/CVPR.2018.00745, 10.1109/TPAMI.2019.2913372]
[10]   TOAN: Target-Oriented Alignment Network for Fine-Grained Image Categorization With Few Labeled Samples [J].
Huang, Huaxi ;
Zhang, Junjie ;
Yu, Litao ;
Zhang, Jian ;
Wu, Qiang ;
Xu, Chang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (02) :853-866