Fine-grained Image Classification by Integrating Object Localization and Heterogeneous Local Interactive Learning

被引:0
作者
Chen, Quan [1 ]
Chen, Fei [1 ]
Wang, Yan-Gen [1 ]
Cheng, Hang [2 ]
Wang, Mei-Qing [2 ]
机构
[1] College of Computer and Data Science, Fuzhou University, Fuzhou
[2] School of Mathematics and Statistics, Fuzhou University, Fuzhou
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2024年 / 50卷 / 11期
基金
中国国家自然科学基金;
关键词
Deep learning; fine-grained image classification; graph neural network; knowledge distillation; weakly supervised object localization;
D O I
10.16383/j.aas.c230507
中图分类号
学科分类号
摘要
Due to the existence of small inter-class differences and large intra-class variance among fine-grained images, the existing classification algorithms only focus on the extraction and representation learning of salient local features of a single image, ignoring the local heterogeneous semantic discrimination information between multiple images, difficult to pay attention to the subtle details that distinguish different categories, resulting in the lack of sufficient discrimination of the learned features. This paper proposes a progressive network to learn the information of different granularity levels of the image in a weakly supervised manner. First, attention accumulation object localization module (AAOLM) is constructed to perform semantic target integration localization on attention information from different training epochs and feature extraction stages on a single image. Second, a multi-image heterogeneous local interactive graph module (HLIGM) is designed to construct a graph network and aggregate information between the local region features of multiple images under the guidance of the category label after extracting the salient local region features of each image to enhance the discriminative power of the representation. Finally, the optimization information generated by HLIGM is fed back to the backbone by using knowledge distillation so that it can directly extract features with strong discrimination, avoiding the computational overhead of building the graph in the test phase. Through experiments on multiple data sets, it proves the effectiveness of the proposed method, which can improve the fine-grained classification accuracy. © 2024 Science Press. All rights reserved.
引用
收藏
页码:2219 / 2230
页数:11
相关论文
共 56 条
[1]  
Luo Jian-Hao, Wu Jian-Xin, A survey on fine-grained image categorization using deep convolutional features, Acta Automatica Sinica, 43, 8, pp. 1306-1318, (2017)
[2]  
Chen Jun-Ying, Chen Ying, Saliency enhanced hierarchical bilinear pooling for fine-grained classification, Journal of Computer-Aided Design & Computer Graphics, 33, 2, pp. 241-249, (2021)
[3]  
Liu D C, Zhao L J, Wang Y, Kato J., Learn from each other to classify better: Cross-layer mutual attention learning for fine-grained visual classification, Pattern Recognition, 140, (2023)
[4]  
Song Y, Sebe N, Wang W., On the eigenvalues of global covariance pooling for fine-grained visual recognition, IEEE Transactions on Pattern Analysis and Machine Intelligence, 45, 3, pp. 3554-3566, (2023)
[5]  
Chou P Y, Kao Y Y, Lin C H., Fine-grained visual classification with high-temperature refinement and background suppression, (2023)
[6]  
Fu J L, Zheng H L, Mei T., Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4476-4484, (2017)
[7]  
Nie X, Chai B S, Wang L Y, Liao Q Y, Xu M., Learning enhanced features and inferring twice for fine-grained image classification, Multimedia Tools and Applications, 82, 10, pp. 14799-14813, (2023)
[8]  
Zheng S J, Wang G C, Yuan Y J, Huang S Q., Fine-grained image classification based on TinyVit object location and graph convolution network, Journal of Visual Communication and Image Representation, 100, (2024)
[9]  
Hu X B, Zhu S N, Peng T L., Hierarchical attention vision transformer for fine-grained visual classification, Journal of Visual Communication and Image Representation, 91, (2023)
[10]  
Zheng H L, Fu J L, Mei T, Luo J B., Learning multi-attention convolutional neural network for fine-grained image recognition, Proceedings of the IEEE International Conference on Computer Vision, pp. 5219-5227, (2017)