Few-Shot Image Classification Based on Multi-Scale Label Propagation

被引:0
作者
Wang H. [1 ]
Tian S. [1 ]
Tang Q. [2 ]
Chen D. [1 ,3 ,4 ]
机构
[1] Big Data Research Center, University of Electronic Science and Technology of China, Chengdu
[2] Communication and Information Technology Center, Petro China Southwest Oil and Gas Company, Chengdu
[3] Chengdu Union Big Data Technology Incorporated, Chengdu
[4] The Key Research Base of Digital Culture and Media of Sichuan Provincial Social Science, Chengdu
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2022年 / 59卷 / 07期
基金
中国国家自然科学基金;
关键词
Feature fusion; Few-shot learning (FSL); Label propagation; Metric learning; Multi-scale feature;
D O I
10.7544/issn1000-1239.20210376
中图分类号
学科分类号
摘要
Under the condition of few-shot, due to the problem of low data, in other words, the labeled data is rare and difficult to gather, it is very difficult to train a good classifier by traditional deep learning. In recent researches, the method based on measuring low level local information and TPN(transductive propagation network)has achieved good classification results. Moreover, local information can measure the relation between features well, but the problem of low data still exists. In order to solve the issue of low data, MSLPN (multi-scale label propagation network)based on TPN is proposed in this paper. The core idea of the method is to use a multi-scale generator to generate image features of multiple scales, obtain the similarity scores of samples with different scale features through the relational measurement module, and obtain classification results by integrating similarity scores at different scales. Specifically, the method firstly generates multiple image features of different scales through a multi-scale generator. And then, the similarity scores of the multi-scale information are used for label propagation. Finally, classification results are obtained by calculating the multi-scale label propagation results. Compared with TPN, in miniImageNet, the classification accuracy of 5-way 1-shot and 5-way 5-shot settings is increased by 2.77% and 4.02% respectively. While in tieredImageNet, the classification accuracy of 5-way 1-shot and 5-way 5-shot settings is increased by 1.16% and 1.27% respectively. The experimental results show that the proposed method in this paper can effectively improve the classification accuracy by using multi-scale feature information. © 2022, Science Press. All right reserved.
引用
收藏
页码:1486 / 1495
页数:9
相关论文
共 26 条
[1]  
LeCun Y, Bengio Y, Hinton G., Deep learning, Nature, 521, 7553, pp. 436-444, (2015)
[2]  
Krizhevsky A, Sutskever I, Hinton G E., ImageNet classification with deep convolutional neural networks, Proc of the 26th Annual Conf on Neural Information Processing Systems, pp. 1097-1105, (2012)
[3]  
Hinton G, Li Deng, Dong Yu, Et al., Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, IEEE Signal Processing Magazine, 29, 6, pp. 82-97, (2012)
[4]  
Alexis C, Holger S, LeCun Y, Et al., Very deep convolutional networks for text classification, Proc of the 15th European Chapter of the Association for Computational Linguistics, pp. 1107-1116, (2017)
[5]  
Silver D, Huang A, Maddison C, Et al., Mastering the game of go with deep neural networks and tree search, Nature, 529, 7587, pp. 484-489, (2016)
[6]  
Russakovsky O, Deng Jia, Su Hao, Et al., ImageNet large scale visual recognition challenge, International Journal of Computer Vision, 115, 3, pp. 211-252, (2015)
[7]  
Everingham M, Van Gool L, Williams C, Et al., The pascal visual object classes (VOC) challenge, International Journal of Computer Vision, 88, 2, pp. 303-338, (2010)
[8]  
Wang Hang, Chen Xiao, Tian Shengzhao, Et al., SAR image recognition based on few-shot learning, Computer Science, 47, 5, pp. 124-128, (2020)
[9]  
Koch G R, Zemel R, Salakhutdinov R., Siamese neural networks for one-shot image recognition [C/OL], Proc of the 32nd Int Conf on Machine Learning, (2015)
[10]  
Vinyals O, Blundell C, Lillicrap T, Et al., Matching networks for one shot learning, Proc of the 30th Annual Conf on Neural Information Processing Systems, pp. 3630-3638, (2016)