Learning Category Distribution for Text Classification

被引:0
作者
Wang, Xiangyu [1 ]
Zong, Chengqing [1 ]
机构
[1] Univ Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Chinese Acad Sci,Sch Artificial Intelligence, Beijing 100049, Peoples R China
关键词
Category distribution; text classification; neural networks;
D O I
10.1145/3585279
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Label smoothing has a wide range of applications in the machine learning field. Nonetheless, label smoothing only softens the targets by adding a uniform distribution into a one-hot vector, which cannot truthfully reflect the underlying relations among categories. However, learning category relations is of vital importance in many fields such as emotion taxonomy and open set recognition. In this work, we propose a method to obtain the label distribution for each category (category distribution) to reveal category relations. Furthermore, based on the learned category distribution, we calculate new soft targets to improve the performance of model classification. Compared with existing methods, our algorithm can improve neural network models without any side information or additional neural network module by considering category relations. Extensive experiments have been conducted on four original datasets and 10 constructed noisy datasets with three basic neural network models to validate our algorithm. The results demonstrate the effectiveness of our algorithm on the classification task. In addition, three experiments (arrangement, clustering, and similarity) are also conducted to validate the intrinsic quality of the learned category distribution. The results indicate that the learned category distribution can well express underlying relations among categories.
引用
收藏
页数:13
相关论文
共 48 条
[1]  
Abernethy Jacob., 2008, AIRWeb '08, P41, DOI DOI 10.1145/1451983.1451994
[2]   Label-Embedding for Image Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438
[3]   Label-Embedding for Attribute-Based Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :819-826
[4]  
Chen C, 2019, AAAI CONF ARTIF INTE, P3304
[5]   Towards better decoding and language model integration in sequence to sequence models [J].
Chorowski, Jan ;
Jaitly, Navdeep .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :523-527
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Ekman P., 1994, NATURE EMOTION FUNDA
[8]  
Forsyth R. S., 1996, Literary & Linguistic Computing, V11, P163, DOI 10.1093/llc/11.4.163
[9]   Recent Advances in Open Set Recognition: A Survey [J].
Geng, Chuanxing ;
Huang, Sheng-Jun ;
Chen, Songcan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) :3614-3631
[10]   Label Distribution Learning [J].
Geng, Xin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (07) :1734-1748