Label smoothing and task-adaptive loss function based on prototype network for few-shot learning

被引:14
作者
Gao, Farong [1 ]
Luo, Xingsheng [1 ]
Yang, Zhangyi [1 ]
Zhang, Qizhong [1 ,2 ]
机构
[1] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
[2] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
关键词
Flexible hyperparameters; Improved loss function; Few-shot learning; Image classification; Deep learning; CLASSIFICATION;
D O I
10.1016/j.neunet.2022.09.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at solving the problems of prototype network that the label information is not reliable enough and that the hyperparameters of the loss function cannot follow the changes of image feature information, we propose a method that combines label smoothing and hyperparameters. First, the label information of an image is processed by label smoothing regularization. Then, according to different classification tasks, the distance matrix and logarithmic operation of the image feature are used to fuse the distance matrix of the image with the hyperparameters of the loss function. Finally, the hyperparameters are associated with the smoothed label and the distance matrix for predictive classification. The method is validated on the miniImageNet, FC100 and tieredImageNet datasets. The results show that, compared with the unsmoothed label and fixed hyperparameters methods, the classification accuracy of the flexible hyperparameters in the loss function under the condition of few-shot learning is improved by 2%-3%. The result shows that the proposed method can suppress the interference of false labels, and the flexibility of hyperparameters can improve classification accuracy.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 66 条
  • [1] Rusu AA, 2019, Arxiv, DOI arXiv:1807.05960
  • [2] [Anonymous], 2010, CONVOLUTIONAL DEEP B
  • [3] Enhancing Metric-Based Few-Shot Classification With Weighted Large Margin Nearest Center Loss
    Bao, Wei
    Huang, Meiyu
    Xiang, Xueshuang
    [J]. IEEE ACCESS, 2021, 9 : 90805 - 90815
  • [4] Bateni P, 2022, Arxiv, DOI arXiv:2201.05151
  • [5] Enhancing Few-Shot Image Classification with Unlabelled Examples
    Bateni, Peyman
    Barber, Jarred
    van de Meent, Jan-Willem
    Wood, Frank
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1597 - 1606
  • [6] Bateni P, 2020, PROC CVPR IEEE, P14481, DOI 10.1109/CVPR42600.2020.01450
  • [7] Berthelot D, 2019, ADV NEUR IN, V32
  • [8] Chen WY, 2020, Arxiv, DOI [arXiv:1904.04232, DOI 10.48550/ARXIV.1904.04232]
  • [9] Chen YB, 2021, Arxiv, DOI arXiv:2003.04390
  • [10] Towards better decoding and language model integration in sequence to sequence models
    Chorowski, Jan
    Jaitly, Navdeep
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 523 - 527