Deep label embedding learning for classification

被引：0

作者：

Nousi, Paraskevi ^{[1
]}

Tefas, Anastasios ^{[1
]}

机构：

[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki, Greece

来源：

APPLIED SOFT COMPUTING | 2024年 / 163卷

关键词：

Label embedding; Soft labels; Class similarities; Instance similarities;

D O I：

10.1016/j.asoc.2024.111925

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The one-hot 0/1 encoding method is the most popularized encoding method of class labels for classification tasks. Despite its simplicity and popularity, it comes with limitations and weaknesses, like failing to capture the inherent uncertainty in data labels, and making classifiers more prone to overfitting. In this paper, these shortcomings are tackled with a framework for learning soft label embeddings. Two variants are proposed: first, a learnable general-class embedding which aims to capture information regarding inter-class similarities, and second, a neural architecture which can be added to any neural classifier and aims to learn inter-instance similarities. The inherent uncertainty in data labels is thus somewhat alleviated, allowing the network to focus on incorrectly classified samples, instead of difficult but correctly classified ones. The experimental study on multiple classification benchmarks of increasing difficulty, using neural networks of varying depth and width, show that the proposed method leads to better classification accuracy, highlighting its ability to generalize to unseen samples.

引用

页数：11

共 41 条

[1] Algan G, 2021, Arxiv, DOI arXiv:2103.10869
[2] Chen BR, 2020, Arxiv, DOI [arXiv:2010.12648, 10.48550/arXiv.2010.12648]
[3] Chen C, 2019, AAAI CONF ARTIF INTE, P3304
[4] Coates A., 2011, P 14 INT C ART INT S, P215
[5] Coates Adam, 2011, NIPS WORKSHOP DEEP L
[6] Soft Labels for Ordinal Regression
Diaz, Raul
Marathe, Amit
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4733 - 4742
[7] A Fusion Model-Based Label Embedding and Self-Interaction Attention for Text Classification
Dong, Yanru
Liu, Peiyu
Zhu, Zhenfang
Wang, Qicai
Zhang, Qiuyue
[J]. IEEE ACCESS, 2020, 8 : 30548 - 30559
[8] El Gayar N, 2006, LECT NOTES ARTIF INT, V4087, P67
[9] Fayek HM, 2016, IEEE IJCNN, P566, DOI 10.1109/IJCNN.2016.7727250
[10] Label smoothing and task-adaptive loss function based on prototype network for few-shot learning
Gao, Farong
Luo, Xingsheng
Yang, Zhangyi
Zhang, Qizhong
[J]. NEURAL NETWORKS, 2022, 156 : 39 - 48

← 1 2 3 4 5 →