Deep label embedding learning for classification

被引:0
作者
Nousi, Paraskevi [1 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki, Greece
关键词
Label embedding; Soft labels; Class similarities; Instance similarities;
D O I
10.1016/j.asoc.2024.111925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The one-hot 0/1 encoding method is the most popularized encoding method of class labels for classification tasks. Despite its simplicity and popularity, it comes with limitations and weaknesses, like failing to capture the inherent uncertainty in data labels, and making classifiers more prone to overfitting. In this paper, these shortcomings are tackled with a framework for learning soft label embeddings. Two variants are proposed: first, a learnable general-class embedding which aims to capture information regarding inter-class similarities, and second, a neural architecture which can be added to any neural classifier and aims to learn inter-instance similarities. The inherent uncertainty in data labels is thus somewhat alleviated, allowing the network to focus on incorrectly classified samples, instead of difficult but correctly classified ones. The experimental study on multiple classification benchmarks of increasing difficulty, using neural networks of varying depth and width, show that the proposed method leads to better classification accuracy, highlighting its ability to generalize to unseen samples.
引用
收藏
页数:11
相关论文
共 41 条
  • [1] Algan G, 2021, Arxiv, DOI arXiv:2103.10869
  • [2] Chen BR, 2020, Arxiv, DOI [arXiv:2010.12648, 10.48550/arXiv.2010.12648]
  • [3] Chen C, 2019, AAAI CONF ARTIF INTE, P3304
  • [4] Coates A., 2011, P 14 INT C ART INT S, P215
  • [5] Coates Adam, 2011, NIPS WORKSHOP DEEP L
  • [6] Soft Labels for Ordinal Regression
    Diaz, Raul
    Marathe, Amit
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4733 - 4742
  • [7] A Fusion Model-Based Label Embedding and Self-Interaction Attention for Text Classification
    Dong, Yanru
    Liu, Peiyu
    Zhu, Zhenfang
    Wang, Qicai
    Zhang, Qiuyue
    [J]. IEEE ACCESS, 2020, 8 : 30548 - 30559
  • [8] El Gayar N, 2006, LECT NOTES ARTIF INT, V4087, P67
  • [9] Fayek HM, 2016, IEEE IJCNN, P566, DOI 10.1109/IJCNN.2016.7727250
  • [10] Label smoothing and task-adaptive loss function based on prototype network for few-shot learning
    Gao, Farong
    Luo, Xingsheng
    Yang, Zhangyi
    Zhang, Qizhong
    [J]. NEURAL NETWORKS, 2022, 156 : 39 - 48