Label-Embedding Bi-directional Attentive Model for Multi-label Text Classification

被引:37
作者
Liu, Naiyin [1 ]
Wang, Qianlong [1 ]
Ren, Jiangtao [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangdong Prov Key Lab Computat Sci, Guangzhou 510275, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label text classification; BERT; Label embedding; Bi-directional attention;
D O I
10.1007/s11063-020-10411-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label text classification is a critical task in natural language processing field. As the latest language representation model, BERT obtains new state-of-the-art results in the classification task. Nevertheless, the text classification framework of BERT neglects to make full use of the token-level text representation and label embedding, since it only utilizes the final hidden state corresponding to CLS token as sequence-level text representation for classification. We assume that the finer-grained token-level text representation and label embedding contribute to classification. Consequently, in this paper, we propose a Label-Embedding Bi-directional Attentive model to improve the performance of BERT's text classification framework. In particular, we extend BERT's text classification framework with label embedding and bi-directional attention. Experimental results on the five datasets indicate that our model has notable improvements over both baselines and state-of-the-art models.
引用
收藏
页码:375 / 389
页数:15
相关论文
共 28 条
[21]  
Vaswani A, 2017, ADV NEUR IN, V30
[22]   A Pipeline for Optimizing F1-Measure in Multi-Label Text Classification [J].
Wang, Bingyu ;
Li, Cheng ;
Pavlu, Virgil ;
Aslam, Jay .
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, :913-918
[23]  
Wang GY, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P2321
[24]  
Wu Y., 2016, GOOGLES NEURAL MACHI
[25]  
Yang Pengcheng, 2018, P 27 INT C COMP LING, P3915
[26]  
Yang WY, 2018, DESTECH TRANS COMP, P545
[27]   ML-KNN: A lazy learning approach to multi-label leaming [J].
Zhang, Min-Ling ;
Zhou, Zhi-Hua .
PATTERN RECOGNITION, 2007, 40 (07) :2038-2048
[28]   Multilabel neural networks with applications to functional genomics and text categorization [J].
Zhang, Min-Ling ;
Zhou, Zhi-Hua .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (10) :1338-1351