Fine-grained Image Classification by Integrating Object Localization and Heterogeneous Local Interactive Learning

被引:0
作者
Chen, Quan [1 ]
Chen, Fei [1 ]
Wang, Yan-Gen [1 ]
Cheng, Hang [2 ]
Wang, Mei-Qing [2 ]
机构
[1] College of Computer and Data Science, Fuzhou University, Fuzhou
[2] School of Mathematics and Statistics, Fuzhou University, Fuzhou
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2024年 / 50卷 / 11期
基金
中国国家自然科学基金;
关键词
Deep learning; fine-grained image classification; graph neural network; knowledge distillation; weakly supervised object localization;
D O I
10.16383/j.aas.c230507
中图分类号
学科分类号
摘要
Due to the existence of small inter-class differences and large intra-class variance among fine-grained images, the existing classification algorithms only focus on the extraction and representation learning of salient local features of a single image, ignoring the local heterogeneous semantic discrimination information between multiple images, difficult to pay attention to the subtle details that distinguish different categories, resulting in the lack of sufficient discrimination of the learned features. This paper proposes a progressive network to learn the information of different granularity levels of the image in a weakly supervised manner. First, attention accumulation object localization module (AAOLM) is constructed to perform semantic target integration localization on attention information from different training epochs and feature extraction stages on a single image. Second, a multi-image heterogeneous local interactive graph module (HLIGM) is designed to construct a graph network and aggregate information between the local region features of multiple images under the guidance of the category label after extracting the salient local region features of each image to enhance the discriminative power of the representation. Finally, the optimization information generated by HLIGM is fed back to the backbone by using knowledge distillation so that it can directly extract features with strong discrimination, avoiding the computational overhead of building the graph in the test phase. Through experiments on multiple data sets, it proves the effectiveness of the proposed method, which can improve the fine-grained classification accuracy. © 2024 Science Press. All rights reserved.
引用
收藏
页码:2219 / 2230
页数:11
相关论文
共 56 条
[41]  
Van Der Maaten L., Accelerating t-SNE using tree-based algorithms, The Journal of Machine Learning Research, 15, 1, pp. 3221-3245, (2014)
[42]  
Xiao J X, Hays J, Ehinger K A, Oliva A, Torralba A., Sun database: Large-scale scene recognition from abbey to zoo, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 3485-3492, (2010)
[43]  
Chen T, Kornblith S, Norouzi M, Hinton G., A simple framework for contrastive learning of visual representations, Proceedings of the 37th International Conference on Machine Learning, pp. 1597-1607, (2020)
[44]  
Grill J B, Strub F, Altche F, Tallec C, Richemond P H, Buchatskaya E, Et al., Bootstrap your own latent a new approach to self-supervised learning, Proceedings of the 34th International Conference on Neural Information Processing Systems, (2020)
[45]  
Mahajan D, Girshick R, Ramanathan V, He K M, Paluri M, Li Y X, Et al., Exploring the limits of weakly supervised pretraining, Proceedings of the 15th European Conference on Computer Vision, pp. 185-201, (2018)
[46]  
Yang Z, Luo T G, Wang D, Hu Z Q, Gao J, Wang L W., Learning to navigate for fine-grained classification, Proceedings of the 15th European Conference on Computer Vision, pp. 420-435, (2018)
[47]  
Ji R Y, Wen L Y, Zhang L B, Du D W, Wu Y J, Zhao C, Et al., Attention convolutional binary neural tree for fine-grained visual categorization, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10465-10474, (2020)
[48]  
Ke X, Cai Y H, Chen B T, Liu H, Guo W Z., Granularity-aware distillation and structure modeling region proposal network for fine-grained image classification, Pattern Recognition, 137, (2023)
[49]  
Cui Y, Song Y, Sun C, Howard A, Belongie S., Large scale fine-grained categorization and domain-specific transfer learning, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4109-4118, (2018)
[50]  
Dubey A, Gupta O, Raskar R, Naik N., Maximum entropy fine-grained classification, Proceedings of the 32nd International Conference on Neural Information Processing Systems, pp. 635-645, (2018)